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HYPOXIA INDUCIBLE FACTOR-1 AND METHOD OF USE 

Statement as to Federally Sponsored Research 
This invention was made in part with funds from the Federal government, 
5 PHS grant R01-DK39869. The government therefore has certain rights in the 

invention. 

FIELD OF THE INVENTION 

This invention relates to hypoxia-related proteins, and specifically to novel 
DNA-binding proteins which are induced by hypoxia. 

10 Background of the Invention 

Mammals require molecular oxygen (0 2 ) for essential metabolic processes 
including oxidative phosphorylation in which 0 2 serves as electron acceptor during 
ATP formation. Systemic, local, and intracellular homeostatic responses elicited 
by hypoxia (the state in which 0 2 demand exceeds supply) include erythropoiesis 

15 by individuals who are anemic or at high altitude (Jelkmann (1992) Physiol. Rev. 

72:449-489), neovascularization in ischemic myocardium (White et al. (1992) Circ. 
Res. 71:1490-1500), and glycolysis in cells cultured at reduced O z tension (Wolfle 
et al. (1983) Eur. J. Biochem. 135:405-412). These adaptive responses either 
increase 0 2 delivery or activate alternate metabolic pathways that do not require 

20 0 2 . Hypoxia-inducible gene products that participate in these responses include 

erythropoietin (EPO) (reviewed in Semenza (1994) HematoL Oncol. Clinics N. 
Amer. 8:863-884), vascular endothelial growth factor (Shweiki et al. (1992) Nature 
359:843-845; Banai et al. (1994) Cardiovasc. Res. 28:1176-1179; Goldberg & 
Schneider (1994) J. Biol. Chem. 269:4355-4359), and glycolytic enzymes (Firth et 

25 al. (1994) Proc. Natl. Acad. Sci. USA 91:6496-6500; Semenza et al. (1994) J. 

Biol. Chem. 269:23757-23763). 

The molecular mechanisms that mediate genetic responses to hypoxia have 
been extensively investigated for the EPO gene, which encodes a growth factor 
that regulates erythropoiesis and thus blood 0 2 -carrying capacity (Jelkmann 

30 (1992) supra : Semenza (1994) supra) . C/s-acting DNA sequences required for 

transcriptional activation in response to hypoxia were identified in the EPO 
3'-flanking region and a frans-acting factor that binds to the enhancer, 
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hypoxia-inducible factor 1 (HIF-I), fulfilled criteria for a physiological regulator of 
EPO transcription: inducers of EPO expression (1% 0 2l cobalt chloride [CoCIJ, 
and desferoxamine [DFX]) also induced HIF-I DNA binding activity with similar 
kinetics; inhibitors of EPO expression (actinomycin D, cycloheximide, and 
5 2-aminopurine) blocked induction of HIF-I activity; and mutations in the EPO 

3'-flanking region that eliminated HIF-I binding also eliminated enhancer function 
(Semenza (1994) supra). These results also support the hypothesis that O z 
tension is sensed by a hemoprotein (Goldberg et al. (1988) Science 
242:1412-1415) and that a signal transduction pathway requiring ongoing 
10 transcription, translation, and protein phosphorylation participates in the induction 
of HiF-1 DNA-binding activity and EPO transcription in hypoxic cells (Semenza 
(1994) supra). 

EPO expression is cell type specific, but induction of HIF-1 activity by 1% 0 2 , 
CoCI 2 , or DFX was detected in many mammalian cell lines (Wang & Semenza 

15 (1993a) Proc. Natl. Acad. Sci. USA 90:4304-4308), and the EPO enhancer 

directed hypoxia-inducible transcription of reporter genes transfected into 
non-EPO-producing cells (Wang & Semenza (1993a) supra : Maxwell et al. (1993) 
Proc. Natl. Acad. Sci. USA 90:2423-2427). RNAs encoding several glycolytic 
enzymes were induced by 1% 0 2 , CoCI 2 , or DFX in EPO-producing Hep3B or 

20 non-producing HeLa cells whereas cycloheximide blocked their induction and 

glycolytic gene sequences containing HIF-I binding sites mediated 
hypoxia-inducible transcription in transfection assays (Firth et al. (1994) supra : 
Semenza et al. (1994) suprg). These experiments support the role of HIF-1 in 
activating homeostatic responses to hypoxia. 

25 SUMMARY OF THE 1NVFMTIOM 

The invention features a substantially purified DNA-binding protein, hypoxia- 
inducible factor-1 (HIF-1), characterized as activating structural gene expression 
where the promoter region of the structural gene contains an HIF-1 binding site. 
Examples of such structural genes include erythropoietin (EPO), vascular 
30 endothelial growth hormone (V-EGF), and glycolytic genes. HIF-1 is composed of 
two subunits, HIF-1ct and an isoform of HIF-1 B. 

The invention features a substantially purified HIF-1 a polypeptide, and a 
nucleotide sequence which encodes HIF-1 a. 
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The invention provides methods for preventing and treating hypoxia-reiated 
disorders, including tissue damage resulting from hypoxia and reperfusion, by 
administering a therapeutically effective amount of HIF-1 protein. Also included in 
the invention is gene therapy by introducing into cells a nucleotide sequence 
encoding HIF-1. The invention also provides a pharmaceutical composition 
comprising a pharmaceutical^ acceptable carrier admixed with a therapeutically 
effective amount of HIF-1 or nucleotide sequence encoding HIF-1. 

The invention further provides a novel HIF-1 a variant polypeptide which 
functionally inactivates HIF-1 in vivo. The invention provides a method for treating 
an HIF-1 -mediated disorder or condition by functional inactivation of HIF-1 by 
administration of an effective amount of the HIF-1 a variant of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a autoradiograph showing dose-dependent induction of HIF-1 DNA 
binding activity by CoCI 2 treatment. Nuclear extracts, prepared from HeLa cells 
cultured in the presence of the 0, 5, 10, 25, 50, 75, 100, 250, 500, or 1000 uM of 
CoCI 2 for 4 h at 37oC, were incubated with W18 probe and analyzed by gel shift 
assay. Lanes 1-8 and 9-12 represent extracts prepared in two separate 
experiments. Arrows indicate HIF-1, constitutive DNA binding activity (C), 
nonspecific activity (NS), and free probe (F). 

Fig. 2 is an autoradiograph showing the results of methylation interference 
analysis with nuclear extracts from CoCI 2 -treated HeLa cells. W18 was 5'-end 
labeled on the coding or noncoding strand, partially methylated, and incubated 
with nuclear extracts. DNA-protein complexes corresponding to HIF-1 , 
constitutive DNA binding activities (C1 and C2), and nonspecific binding activity 
(NS) were isolated from a preparative gel shift assay (lower) in addition to free 
probe (F) (not shown). DNA was purified, cleaved with piperidine, and analyzed 
on a 15% denaturing polyacrylamide gel (upper). Results are summarized at left 
for coding strand and at right for noncoding strand. The guanine residues are 
numbered according to their locations on the W18 probe. The HIF-1 binding site 
is boxed. Complete methylation interference with HIF-1 binding is indicated in 
closed circles; partial and complete methylation interference with constitutive DNA 
binding activity are indicated by open and closed squares, respectively. 
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Fig. 3A is an autoradiograph showing gel shift assay analysis of column 
fractions for HIF-1 DNA binding activity. Nuclear extracts were fractionated by 
DEAE-Sepharose chromatography, and fractions containing HIF-1 activity were 
applied to a W18 DNA affinity column. 5 ug of protein were incubated with 0.1 ug 
of calf thymus DNA for gel shift analysis of crude nuclear extract (Crude NE, lane 
1) and HIF-1 active fractions from DEAE-Sepharose columns (DEAE, lane 2). For 
fractions from the W18 column (lanes 3-13), 1 ul aliquots were incubated with 5 
ng of calf thymus DNA. The positions of the two HIF-1 bands, constitutive activity 
(C). nonspecific activity (NS), and free probe (F) are indicated. FT, flowthrough, 
0.25 M, 0.5 M, 1 M, and 2 M are fractions eluted with indicated concentration of 
KCI in buffer Z. 

Fig. 3B is an autoradiograph showing sequence-specific DNA binding of the 
partially purified fractions described in the legend to Fig. 3A. 5 ug aliquots of 
fractions from the DEAE-Sepharose column were incubated with W18 probe in 
the presence of no competitor (lane 1), 10-fold (lanes 2 and 5), 50-fold (lanes 3 
and 6), or 250-fold (lanes 4 and 7) molar excess of unlabeled W18 (W, lanes 2-4) 
or M18 (M, lanes 5-7) oligonucleotide. 

Fig. 4A is an autoradiograph showing purification of HIF-1 from CoCI 2 -treated 
HeLa S3 cells. Flowthrough fraction from the M18 DNA column (Load, lane 1) 
and 0.25 M KCI and 0.5 M KCI fractions from the second W1 8 DNA affinity 
column (lanes 2 and 3) were analyzed. An aliquot of each fraction (5 ug of load 
or 1 ug of affinity column fractions) were resolved by 6% SDS-PAGE and silver 
stained. HIF-1 polypeptides in lanes 2 and 3 are indicated by arrows at the right 
of the figure. 

Fig. 4B is an autoradiograph showing HIF-1 purification from hypoxic Hep3B 
cells. HIF-1 fractions from the first W18 column (Load, lane 1) and 0.25 M KCI 
and 0.5 M KCI fractions from the second W18 column (lanes 2 and 3) were 
analyzed. An aliquot of each fraction (50 ul) was resolved by 7% SDS-PAGE and 
silver stained. Molecular mass markers are myosin (200 kDa), p-galactosidase 
(116 kDa), phosphorylase (97 kDa), BSA (66 kDa), and ovalbumin (45 kDa). HIF- 
1 polypeptides in lanes 2 and 3 are indicated by arrows at the right of the figure. 
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Fig. 5A is an autoradiograph identifying the HIF-1 polypeptides. An aliquot of 
affinity-purified HIF-1 was resolved on a 6% SDS-polyacrylamide gel with 3.2% 
cross-linking along with the HIF-1 protein complex isolated by preparative native 
gel shifl assay (HIF-1). MW, molecular mass markers with size (kDa) indicated at 
left of figure; numbers to the right of figure indicate the apparent molecular 
weights (kDa) of HIF-1 polypeptides. 

Fig. 5B is an autoradiograph showing the HIF-1 components on a 6% SDS- 
polyacrylamide gel with 5% cross-linking. An aliquot of affinity-purified HIF-1 was 
resolved on a 6% SDS-polyacrylamide gel along with the HIF-1 protein complex 
isolated by preparative native gel shift assay (HIF-1). The 120 kDa polypeptide, 
94/93/91 kDa polypeptides, and two contaminant proteins (*1 and *2) are 
indicated. 

Fig. 5C is an autoradiograph showing the alignment of HIF-1 components 
identified on two gel systems with different degrees of cross-linking. Gel slices 
isolated from the 6% SDS-polyacrylamide gel with 5% cross-linking corresponding 
to 120 kDa HIF-1 polypeptide (12), 94/93/91 kDa HIF-1 polypeptide (94/93/91), 
and two contaminant proteins (*1 and *2) were resolved on a 6% SDS- 
polyacrylamide gel with 3.2% cross-linking in parallel with an aliquot (30 ul) of 
affinity purified HIF-1 (Fig. 5A). 

Fig. 6 is a graph of the absorbance profiles at 215 nm of tryptic peptides 
derived from 91 kDa HIF-1 polypeptide (top), 93/94 kDa polypeptides (middle), 
and trypsin (bottom). 

Fig. 7 is an autoradiograph showing UV cross-linking analysis with affinity 
purified HIF-1 and probe W18 in the absence (lane 1) or presence of 250-fold 
molar excess of unlabeled W18 (lane 2) or M18 (lane 3) oligonucleotide. The 
binding reaction mixtures were UV-irradiated and analyzed on a 6% SDS- 
polyacrylamide gel. Molecular mass standards are indicated at left. 

Fig. 8 is an autoradiograph showing the results of glycerol gradient 
sedimentation analysis. Nuclear extracts prepared from Hep3B cells exposed to 
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1% 0 2 for 4 h (Load) was seclimented through a 10-30% linear glycerol gradient. 
Aliquots (10 ul) from each fraction were analyzed by gel shift assay. Arrows at 
top indicate the peak migration for ferritin (440 kDa) t catalase (232 kDa), aldolase 
(158 kDa), and BSA (67 kDa). 

5 FIG. 9 is a diagram of the cDNA sequence encoding HIF-la. Bold lines 

indicate extent of clones hbc120, hbc025 ( and 3.2-3 relative to the full-length 
RNA-coding sequence shown below. Box, amino acid coding sequences; thin 
line, untranslated sequences; bHLH, basic helix-loop-helix domain; A and B, 
internal homology units within the PAS domain. 

10 Fig. 10 is the nucleotide and derived amino acid sequence of HIF-la. A 

composite sequence was derived from the complete nucleotide sequences 
determined for clones 3.2-3 (nt 1-3389), hbc025 (nt 135-3691), and hbc120 (nt 
1739-3720). Sequences of four tryptic peptides obtained from the purified HIF-la 
120 kDa polypeptide are underscored (two peptides are contiguous). 

15 Fig. 1 1 is the analysis of bHLH domains. Coordinate of first residue of each 

sequence and amino acid identity with HIF- 1a or HIF- ip (ARNT) are given in 
parentheses at left and right margins, respectively. Hyphen indicates gap 
introduced into sequence to maximize alignment except in consensus where it 
indicates a lack of agreement. Consensus indicates at least 3 proteins with 

20 identical or similar residue at a given position. 1 : F, I, L, M, or V; 2: S or T; 3: D or 

E; 4: K or R. Invariant residues are shown in bold. 

Fig. 12 is the analysis of PAS domains. Alignments of PAS A (top) and B 
(bottom) subdomains are shown. Consensus indicates at least 4 proteins with 
identical or similar residue at a given position. GenBank accession numbers: 
25 ARNT, M69238; AHR, L19872; SIM, M19020; Ml, Z23066; USF, X55666; L-MYC, 

X13945; CP-1, M34070; PER, M30114; KinA, M31067. 

Fig. 13A is an autoradiograph showing HIF-1a and HIF-13 RNA expression 
after exposure of Hep3B cells to 1% 0 2 for 0, 1 , 2, 4, 8, and 16 h. 
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Fig. 13B is an autoradiograph showing HIF-1ct and HIF-1p RNA expression 
after exposure of Hep3B cells to 75 uM CoCI 2 for 0, 1, 2, 4, 8, and 16 h. 

Fig. 13C is an autoradiograph showing HIF-1a and HIF-1p RNA expression 
after exposure of Hep3B cells to 130 uM desferoxamine (DFX) for 0, 1, 2, 4, 8, 
5 and 16 h. 

Fig. 13D is an autoradiograph showing HIF-1a and H1F-10 RNA expression 
after exposing Hep3B cells to 1% 0 2 for 4 h, then returning the cells to 20% 0 2 for 
0 t 5, 15, 30, or 60 min prior to RNA isolation. 

Fig. 13E is a table of the AUUUA-containing elements from the HIF-1a 3- 
10 UTR. The first nucleotide is numbered according to the composite cDNA 

sequence. 

Fig. 14A is an autoradiograph of nuclear extracts from hypoxic Hep3B cells 
incubated with oligonucleotide probe W18 for 10 min on ice, immune sera was 
added (lanes 2 and 5) and incubated for 20 min on ice, followed by 
15 polyacrylamide gel electrophoresis. Preimmune sera (lanes 3 and 5) and antisera 

(lanes 2 and 4) were obtained from rabbits before and after immunization, 
respectively, with GST/HIF-1a (lanes 2 and 3) or GSTYHIF-1p (lanes 4 and 5). 
HIF-1, constitutive (C) and nonspecific (NS) DNA binding activities, free probe (F), 
and supershifted HIF-1 /DNA/antibody complex (S) are indicated. 

20 F»g- 14B is an immunoblot showing antisera recognition of HIF-1 subunits 

present in purified protein preparations and crude protein extracts. Nuclear 
extracts from Hep3B cells which were untreated (lane 1) or exposed to 1% 0 2 for 
4 h (lane 2) and from HeLa cells which were untreated (lane 6) or exposed to 75 
uM CoCI 2 for 4 h (lane 7) were fractionated on a 6% SDS/polyacrylamide gel in 

25 parallel with 1, 2, and 5 ul of affinity-purified HIF-1 from CoCI 2 -treated HeLa cells 

(lanes 3-5). Protein was transferred to a nitrocellulose membrane and incubated 
with antisera to HIF-1 a (top) or HIF-1 p (bottom). 
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Fig. 14C is an immunoblot showing the induction kinetics of HIF-1a and HIF- 
13 protein in hypoxic cells. Hep3B cells were exposed to 1% 0 2 for 0 to 16 h prio 
to preparation of nuclear (N.E.) and cytoplasmic (C.E.) extracts, and immunoblot 
analysis was performed with antisera to HlF-1a (top) or HIF-1 3 (bottom). 

Fig. 14D is an immunoblot showing decay kinetics of HIF-1a and HIF-1 3 
polypeptides in post-hypoxic cells. Hep3B cells were exposed to 1% 0 2 for 4 h 
and returned to 20% 0 2 for 0 to 60 min prior to preparation of extracts and 
immunoblot analysis. Arrowheads distinguish HIF-1 subunits from cross-reacting 
proteins of unknown identity. 



Fig. 15A is an diagram of the structure of reporter gene constructs used for 
functional analysis of HIF-1 binding sites in human aldolase A (hALDA), human 
phosphoglycerate kinase 1 (hPGK1), and mouse phosphofructokinase L (mPFKL) 
genes. Arrow, transcription initiation site; box, hEPO 3'-FS (cross-hatched), 
hPGK1 5'-FS (stippled), or mPFKL IVS-1 (striped) oligonucleotide (sequences are 
as shown in Table 3). DNA fragments from the 5'-end of the hALDA gene in 
pNMHcat and pHcat are 3.5 and 0.76 kb, respectively, and are colinear at the 3 1 - 
end where they are directly fused to CAT coding sequences. 

Fig. 15B is a bar graph showing CAT/B-galactosidase expression (relative 
CAT activity) in transfected cells exposed to 20% 0 2 (open bar) or 1% 0 2 (closed 
bar). Data are plotted using lower scale for all results except those for pHcat, 
which are plotted according to the upper scale. Induction, representing the 
relative CAT activity at 1% O 2 /20%O 2 , was calculated for each experiment; mean 
and standard error of mean (SEM) were determined for results from n 
independent experiments. 



Fig. 16 is the amino-terminal (top) and carboxy-terminal (bottom) amino acid 
sequence of the wild-type and dominant-negative variant forms of HIF-1 a. 

DETAILED DESCRI PTION OF THE INVENTION 

The invention provides a substantially pure hypoxia-inducible factor-1 (HIF-1) 
characterized as a DNA-binding protein which binds to a region in the regulatory, 
preferably in the enhancer region, of a structural gene having the HIF-1 binding 
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motif. Included among the structural genes which can be activated by HIF-1 are 
erythropoietin (EPO), vascular endothelial growth factor (VEGF), and glycolytic 
gene transcription in cells subjected to hypoxia. Analysis of purified HIF-1 shows 
that it is composed of subunits HIF-1 a and an isoform of HIF-1 p. In addition to 
having domains which allow for their mutual association in forming HIF-1, the a 
and 3 subunits of HIF-1 both contain DNA-binding domains. The alpha subunit is 
uniquely present in HIF-1, whereas the beta subunit (ARNT) is a component of at 
least two other transcription factors. 

The invention provides a substantially pure hypoxia-inducible factor-1a (HIF- 
1a) polypeptide characterized as having a molecular weight of 120 kDa as 
determined by SDS-PAGE and having essentially the amino acid sequence of 
SEQ ID NO:2 (Fig. 1 0) and dimerizing to HIF-1 p to form HIF-1 . The term 
"substantially pure" as used herein refers to HIF-1 a which is substantially free of 
other proteins, lipids, carbohydrates or other materials with which it is naturally 
associated. One skilled in the art can purify HIF-1 a using standard techniques for 
protein purification. The substantially pure polypeptide will yield a single band on 
a non-reducing polyacrylamide gel. The purity of the HIF-1a polypeptide can also 
be determined by amino-terminal amino acid sequence analysis. HIF-1 a protein 
includes functional fragments of the polypeptide, as long as the activity of HIF-1a, 
such as the ability to bind with HIF-1 p, remains. Smaller peptides containing the 
biological activity of HIF-1 a are included in the invention. 

The invention provides nucleotide sequences encoding the HIF-1 a 
polypeptide (SEQ ID NO:1)(Fig. 10). These nucleotides include DNA, cDNA, and 
RNA sequences which encode HIF-1 a. It is also understood that all nucleotide 
sequences encoding all or a portion of HIF-1 a are also included herein, as long 
as they encode a polypeptide with HIF-1 a activity. Such nucleotide sequences 
include naturally occurring, synthetic, and intentionally manipulated nucleotide 
sequences. For example, HIF-1 a nucleotide sequences may be subjected to 
site-directed mutagenesis. The nucleotide sequence for HIF-1 a also includes 
antisense sequences. The nucleotide sequences of the invention include 
sequences that are degenerate as a result of the genetic code. All degenerate 
nucleotide sequences are included in the invention as long as the amino acid 
sequence of HIF-1 a polypeptide which is encoded by the nucleotide sequence is 
functionally unchanged. 
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Specifically disclosed herein is a DNA sequence encoding the human HIF-1a 
gene. The sequence contains an open reading frame encoding a polypeptide 826 
amino acids in length. The human HIF-1a initiation methionine codon shown in 
FIG. 10 at nucleotide position 29-31 is the first ATG codon following the in-frame 
stop codon at nucleotides 2-4. Preferably, the human HIF-1a amino acid 
sequence is SEQ ID NO:2. 

The nucleotide sequence encoding HIF-1a includes SEQ ID NO:1 as well as 
nucleic acid sequences complementary to SEQ ID NO:1 . A complementary 
sequence may include an antisense nucleotide. When the sequence is RNA, the 
deoxynucleotides A, G, C. and T of SEQ ID NO:2 are replaced by ribonucleotides 
A, G, C, and U, respectively. Also included in the invention are fragments of the 
above-identified nucleic acid sequences that are at least 15 bases in length, 
which is sufficient to permit the fragment to selectively hybridize to DNA or RNA 
that encodes the polypeptide of SEQ ID NO:2 under physiological conditions. 
Specifically, the fragments should hybridize to DNA or RNA encoding HIF-1a 
protein under stringent conditions. 

Minor modifications of the HIF-1a primary amino acid sequence may result in 
proteins which have substantially equivalent activity as compared to the HIF-1a 
polypeptide described herein. Such proteins include those as defined by the term 
"having essentially the amino acid sequence of SEQ ID NO:2". Such 
modifications may be deliberate, as by site-directed mutagenesis, or may be 
spontaneous. All of the polypeptides produced by these modifications are 
included herein as long as the biological activity of HIF-1a still exists. Further, 
deletions of one or more amino acids can also result in modification of the 
structure of the resultant molecule without significantly altering its biological 
activity. This can lead to the development of a smaller active molecule which 
would have broader utility. For example, one can remove amino or carboxy 
terminal amino acids which are not required for HIF-1a biological activity. 

The HIF-1cc polypeptide of the invention encoded by the nucleotide sequence 
of the invention includes the disclosed sequence (SEQ ID NO:2) and conservative 
variations thereof. The term "conservative variation" as used herein denotes the 
replacement of an amino acid residue by another, biologically similar residue. 
Examples of conservative variations include the substitution of one hydrophobic 
residue such as isoleucine, valine, leucine, or methionine for another, or the 
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substitution of one polar residue for another, such as the substitution of arginine 
for lysine, glutamic acid for aspartic acid, or glutamine for asparagine, and the 
like. The term "conservative variation" also includes the use of a substituted 
amino acid in place of an unsubstituted parent amino acid provided that 
5 antibodies raised to the substituted polypeptide also immunoreact with the 

unsubstituted polypeptide. 

The DNA sequences of the invention can be obtained by several methods. 
For example, the DNA can be isolated using hybridization techniques which are 
well known in the art. These include, but are not limited to: 1) hybridization of 
10 genomic or cDNA libraries with probes to detect homologous nucleotide 

sequences, 2) polymerase chain reaction (PCR) on genomic DNA or cDNA using 
primers capable of annealing to the DNA sequence of interest, and 3) antibody 
screening of expression libraries to detect cloned DNA fragments with shared 
structural features. 

15 Preferably the HIF-1a nucleotide sequence of the invention is derived from a 

mammalian organism, and most preferably from human. Screening procedures 
which rely on nucleic acid hybridization make it possible to isolate any gene 
sequence from any organism, provided the appropriate probe is available. 
Oligonucleotide probes, which correspond to a part of the sequence encoding the 

20 protein in question, can be synthesized chemically. This requires that short, 

oligopeptide stretches of amino acid sequences must be known. The DNA 
sequence encoding the protein can be deduced from the genetic code, however, 
the degeneracy of the code must be taken into account. It is possible to perform 
a mixed addition reaction when the sequence is degenerate. This includes a 

25 heterogeneous mixture of denatured double-stranded DNA. For such screening, 

hybridization is preferably performed on either single-stranded DNA or denatured 
double-stranded DNA. Hybridization is particularly useful in the detection of 
cDNA clones derived from sources where an extremely low amount of mRNA 
sequences relating to the polypeptide of interest are present. In other words, by 

30 using stringent hybridization conditions directed to avoid non-specific binding, it is 

possible, for example, to allow the autoradiographic visualization of a specific 
cDNA clone by the hybridization of the target DNA to that single probe in the 
mixture which is its complete complement (Sambrook et al. (1989) Molecular 
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Cloning: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, 
Plainview, NY). 

The development of specific DNA sequences encoding HIF-1d can also be 
obtained by: 1) isolation of double-stranded DNA sequences from the genomic 
5 DNA; 2) chemical manufacture of a DNA sequence to provide the necessary 

codons for the polypeptide of interest; and 3) in vitro synthesis of a 
double-stranded DNA sequence by reverse transcription of mRNA isolated from a 
eukaryotic donor cell. In the latter case, a double-stranded DNA complement of 
mRNA is eventually formed which is generally referred to as cDNA. Of the three 

10 above-noted methods for developing specific DNA sequences for use in 

recombinant procedures, the isolation of genomic DNA isolates is the least 
common. This is especially true when it is desirable to obtain the microbial 
expression of mammalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the 

15 entire sequence of amino acid residues of the desired polypeptide product is 

known. When the entire sequence of amino acid residues of the desired 
polypeptide is not known, the direct synthesis of DNA sequences is not possible 
and the method of choice is the synthesis of cDNA sequences. Among the 
standard procedures for isolating cDNA sequences of interest is the formation of 

20 plasmid- or phage-carrying cDNA libraries which are derived from reverse 

transcription of mRNA which is abundant in donor cells that express the gene of 
interest at a high level. When used in combination with polymerase chain 
reaction technology, even rare expression products can be cloned. In those 
cases where significant portions of the amino acid sequence of the polypeptide 

25 are known, the production of labeled single or double-stranded DNA or RNA 

probe sequences duplicating a sequence putatively present in the target cDNA 
may be employed in DNA/DNA hybridization procedures which are carried out on 
cloned copies of the cDNA which have been denatured into a single-stranded 
form (Jay et al. (1983) Nucl. Acid Res., 1 1:2325). 

30 A cDNA expression library, such as lambda gt1 1 , can be screened indirectly 

for HIF-1cc peptides having at least one epitope, using antibodies specific for HIF- 
1a. Such antibodies can be either polyclonally or monoclonally derived and used 
to detect expression product indicative of the presence of HIF-1a cDNA. 
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DNA sequences encoding HIF-1ct can be expressed in vitro by DNA transfer 
into a suitable host cell. "Host cells" are cells in which a vector can be 
propagated and its DNA expressed. The term also includes any progeny of the 
subject host cell. It is understood that all progeny may not be identical to the 
5 parental cell since there may be mutations that occur during replication. 

However, such progeny are included when the term "host cell" is used. Methods 
of stable transfer, meaning that the foreign DNA is continuously maintained in the 
host, are known in the art. 

In the present invention, the HIF-1a nucleotide sequences may be inserted 

10 into a recombinant expression vector. The term "recombinant expression vector" 
refers to a plasmid, virus or other vehicle known in the art that has been 
manipulated by insertion or incorporation of the HIF-1cc genetic sequences. Such 
expression vectors contain a promoter sequence which facilitates the efficient 
transcription in the host of the inserted genetic sequence. The expression vector 

15 typically contains an origin of replication, a promoter, as well as specific genes 

which allow phenotypic selection of the transformed cells. Vectors suitable for 
use in the present invention include, but are not limited to the T7-based 
expression vector for expression in bacteria (Rosenberg et al. (1987) Gene 
56:125), the pMSXND expression vector for expression in mammalian cells (Lee 

20 and Nathans (1988) J. Biol. Chem. 263:3521) and baculovirus-derived vectors for 

expression in insect cells. The DNA segment can be present in the vector 
operably linked to regulatory elements, for example, a promoter (e.g., T7, 
metallothionein I, or polyhedron promoters). 

Nucleotide sequences encoding HIF-1a can be expressed in either 

25 prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and 

mammalian organisms. Methods of expressing DNA sequences having eukaryotic 
or viral sequences in prokaryotes are well known in the art. Biologically functional 
viral and plasmid DNA vectors capable of expression and replication in a host are 
known in the art. Such vectors are used to incorporate DNA sequences of the 

30 invention. 

Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques as are well known to those skilled in the art. Where the 
host is prokaryotic, such as E. co//, competent cells which are capable of DNA 
uptake can be prepared from cells harvested after exponential growth phase and 
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subsequently treated by the CaCI 2 method using procedures well known in the art. 
Alternatively, MgCI 2 or RbCI can be used. Transformation can also be performed 
after forming a protoplast of the host cell if desired. 

When the host is a eukaryote, such methods of transfection of DNA as 
5 calcium phosphate co-precipitates, conventional mechanical procedures such as 

microinjection, electroporation, insertion of a plasmid encased in liposomes, or 
virus vectors may be used. Eukaryotic cells can also be cotransformed with DNA 
sequences encoding the HIF-1a of the invention, and a second foreign DNA 
molecule encoding a selectable phenotype, such as the herpes simplex thymidine 

10 kinase gene. Another method is to use a eukaryotic viral vector, such as simian 

virus 40 (SV40) or bovine papilloma virus, to transiently infect or transform 
eukaryotic cells and express the protein (see, for example, Eukaryotic Viral 
Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1982). 

Isolation and purification of microbial expressed polypeptide, or fragments 

15 thereof, provided by the invention, may be carried out by conventional means 

including preparative chromatography and immunological separations involving 
monoclonal or polyclonal antibodies. 

The HIF-1a polypeptides of the invention can also be used to produce 
antibodies which are immunoreactive or bind to epitopes of the HIF-1a 

20 polypeptides. Such antibodies can be used, for example, in standard affinity 

purification techniques to isolate HIF-1a or HIF-1 . Antibody which consists 
essentially of pooled monoclonal antibodies with different epitopic specificities, as 
well as distinct monoclonal antibody preparations are provided. Monoclonal 
antibodies are made from antigen containing fragments of the protein by methods 

25 well known in the art (Kohler et al. (1975) Nature 256:495; Current Protocols in 

Molecular Biology, Ausubel et al., ed., 1989). 

For purposes of the invention, an antibody or nucleic acid probe specific for 
HIF-1 a may be used to detect HIF-1 a polypeptide (using antibody) or nucleotide 
sequences (using nucleic acid probe) in biological fluids or tissues. The antibody 

30 reactive with HIF-1a or the nucleic acid probe is preferably labeled with a 

compound which allows detection of binding to HIF-1 a. Any specimen containing 
a detectable amount of antigen or polynucleotide can be used. Various detectable 
labels and assay formats are well known to those of ordinary skill in the art and 
can be utilized without resort to undue experimentation. 
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When the cell component is nucleic acid, it may be necessary to amplify the 
nucleic acid prior to binding with an HIF-1 a specific probe. Preferably, polymerase 
chain reaction (PGR) is used, however, other nucleic acid amplification 
procedures such as ligase chain reaction (LCR), ligated activated transcription 
5 (LAT) and nucleic acid sequence-based amplification (NASBA) may be used. 

The present invention provides a HIF-1a variant polypeptide characterized as 
dimerizing with HIF-1 (J to form a functionally inactive HIF-1 complex in that the 
complex is not able to sufficiently bind to the HIF-1 binding motif in the regulatory 
region to allow efficient expression of the structural gene under control of the 

10 regulatory region. The invention further provides nucleotide sequences encoding 
HIF-1 a variants. In one specific embodiment, the polynucleotide encoding HIF- 
1a variant is provided having the polynucleotide sequence of SEQ ID NO:3. The 
HIF-1a variant polypeptide SEQ ID NO:4 is generated by substitution of wild-type 
amino acids with different amino acids and by deleting a portion of the wild-type 

15 sequence. Modifications of the HIF-1 a variant amino acid sequence are 

encompassed by the invention so long as the resulting polypeptide dimerizes to 
HIF-1 p to form a functionally inactive HIF-1 complex in the sense that the HIF-1 
complex or dimer no longer sufficiently binds DNA. In a preferred embodiment of 
the invention, specific HIF-1 a variants are provided wherein one or more the 

20 amino acids that participate in the binding of HIF-1 to DNA are replaced using 

techniques of genetic engineering. 

The specific dominant-negative variant forms of HIF-1 a are HIF-1 ccANB and 
HIF-1aANBAAB (see Example 10). These two forms have in common a deletion 
of the amino acids that comprise the basic domain required for DNA binding (HIF- 

25 1a amino acid residues 17-30; Fig. 10). Any variant form of HIF-1 a in which 

modification of the basic domain eliminates DNA binding activity while maintaining 
the ability of HIF-1 a to dimerize with HIF-1 p should function as a dominant 
negative variant. Such alterations of the nucleotide sequence encoding the basic 
domain include deletions or substitutions of critical basic amino acid residues 

30 within the domain that are required for DNA binding. Additional modifications of 

the protein may enhance the dominant negative effect in vivo. For example, the 
HIF-1 aANBAAB variant contains the same mutation in the basic domain as HIF- 
1aANB (Fig. 16) but, in addition, HIF-1 aANBAAB is also truncated at the carboxy 
terminus to improve its protein stability in vivo. 
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The nucleotide sequences encoding HIF-1a variant molecules of the 
invention can be inserted into an appropriate expression vector and expressed in 
cells. Modified versions of the specific HIF-1 a variant of SEQ ID NO:4 can be 
engineered to enhance stability, production, purification, or yield of the expressed 
5 product. For example, the expression of a fusion protein or a cleavable fusion 

protein comprising the HIF-1 a variant and a heterologous protein can be 
engineered. Such a fusion protein can be readily isolated by affinity 
chromatography, e.g., by immobilization on a column specific for the heterologous 
protein. Where a cleavage site is engineered between the HIF-1cc moiety and the 

10 heterologous protein, the HIF-1 a polypeptide can be released from the 

chromatographic column by treatment with an appropriate enzyme or agent that 
disrupts the cleavage site (Booth et al. (1988) Immunol. Lett. 19:65-708; Gardella 
et al. (1990) J. Biol. Chem. 265:15854-15859). 

The invention provides methods for treatment of HIF-1 -mediated disorders, 

15 including hypoxia-mediated tissue damage, which are improved or ameliorated by 

modulation of HIF-1 gene expression or activity. The term "modulate" envisions 
the inhibition of expression of HIF-1 when desirable, or enhancement of HIF-1 
expression when appropriate. Where expression or enhancement of expression 
of HIF-1 is desirable, the method of the treatment includes direct (protein) or 

20 indirect (nucleotide) administration of HIF-1 . 

According to the method of the invention, substantially purified HIF-1 or the 
nucleotide sequence encoding HIF-1 is introduced into a human patient for the 
treatment or prevention of HIF-1 -mediated disorders. The appropriate human 
patient is a subject suffering from a HIF-1 -mediated disorder or a hypoxia-related 

25 disorder, such as atherosclerotic coronary or cerebral artery disease. When a 

patient is treated with nucleotide, the nucleotide can be a sequence which 
encodes HIF-1 a or a nucleotide sequence which encodes HIF-1 a and a 
nucleotide sequence which encodes HIF-1 (J (see, for example, Rayes, et a/., 
Science, 256:1193-1195, 1992; and Hoffman, et al, Science, 252:954-958, 

30 1991). 

Where inhibition of HIF-1 a expression is desirable, such as the inhibition of 
tumor proliferation mediated by VEGF-induced angiogenesis, inhibitory nucleic 
acid sequences that interfere with HIF-1 expression at the translational level can 
be used. This approach utilizes, for example, antisense nucleic acid, ribozymes, 
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or triplex agents to block transcription or translation of a specific HIF-1a mRNA or 
DNA, either by masking that mRNA with an antisense nucleic acid or DNA with a 
triplex agent, or by cleaving the nucleotide sequence with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary 
5 to at least a portion of a specific mRNA molecule (Weintraub (1990) Scientific 

American 262:40). In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA, forming a double-stranded molecule. The antisense 
nucleic acids interfere with the translation of the mRNA, since the cell will not 
translate a mRNA that is double-stranded. Antisense oligomers of about 15 
10 nucleotides are preferred, since they are easily synthesized and are less likely to 

cause problems than larger molecules when introduced into the target HIF- 
1a-producing cell. 

Use of an oligonucleotide to stall transcription is known as the triplex strategy 
since the oligomer winds around double-helical DNA, forming a three-strand helix. 

15 Therefore, these triplex compounds can be designed to recognize a unique site 

on a chosen gene (Maher et al. (1991) Antisense Res. and Dev. 1:227; Helene 
(1991) Anticancer Drug Design, 6:569). 

Ribozymes are RNA molecules possessing the ability to specifically cleave 
other single stranded RNA in a manner analogous to DNA restriction 

20 endonucleases. Through the modification of nucleotide sequences which encode 

these RNAs, it is possible to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech (1988) J. Amer. 
Med. Assn. 260:3030). A major advantage of this approach is that, because they 
are sequence-specific, only mRNAs with particular sequences are inactivated. 

25 There are two basic types of ribozymes namely, tetrahymena-type 

(Hasselhoff (1988) Nature 334:585) and "hammerhead"-type. Tetrahymena-type 
ribozymes recognize sequences which are four bases in length, while 
M hammerhead M -type ribozymes recognize base sequences 11-18 bases in 
length. The longer the recognition sequence, the greater the likelihood that the 

30 sequence will occur exclusively in the target mRNA species. Consequently, 

hammerhead-type ribozymes are preferable to tetrahymena-type ribozymes for 
inactivating a specific mRNA species and 18-based recognition sequences are 
preferable to shorter recognition sequences. 
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Suppression of HIF-1 function can also be achieved through administration of 
HIF-1a variant polypeptide (dominant negative variant form), or a nucleotide 
sequence encoding HIF-1 a variant polypeptide. For example, in the case of 
disorders enhanced by expression of HIF-1 a, such as tumor proliferation 
secondary to VEGF-mediated angiogenesis, it would be desirable to "starve" the 
tumor by inhibiting neovascularization necessary to supply sufficient nutrients to 
the tumor. By administering HIF-1 a variant polypeptide or a nucleotide sequence 
encoding such polypeptide, the variant will compete with wild-type HIF-1 a for 
binding to HIF-1 p in forming HIF-1 dimer thereby lowering the concentration of 
HIF-1 dimer in the cell which can efficiently bind to the HIF-1 DNA binding motif. 

The present invention also provides gene therapy for the treatment of 
hypoxia-related disorders, which are improved or ameliorated by the HIF-1 
polypeptide. Such therapy would achieve its therapeutic effect by introduction of 
the HIF-1 a nucleotide, alone or in combination with HIF-1 p nucleotide, into cells 
exposed to hypoxic conditions. Delivery of HIF-1a nucleotide, alone or in 
combination with HIF-P nucleotide, can be achieved using a recombinant 
expression vector such as a chimeric virus or a colloidal dispersion system. 
Especially preferred for therapeutic delivery of sequences is the use of targeted 
liposomes. 

Various viral vectors which can be utilized for gene therapy as taught herein 
include adenovirus, adeno-associated virus, herpes virus, vaccinia, or, preferably, 
an RNA virus such as a retrovirus. Preferably, the retroviral vector is a derivative 
of a murine or avian retrovirus. Examples of retroviral vectors in which a single 
foreign gene can be inserted include, but are not limited to: Moloney murine 
leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine 
mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). Preferably, 
when the subject is a human, a vector such as the gibbon ape leukemia virus 
(GaLV) is utilized. A number of additional retroviral vectors can incorporate 
multiple genes. All of these vectors can transfer or incorporate a gene for a 
selectable marker so that transduced cells can be identified and generated. By 
inserting a HIF-1 a sequence of interest into the viral vector, along with another 
gene which encodes the ligand for a receptor on a specific target cell, for 
example, the vector is now target specific. Retroviral vectors can be made target 
specific by attaching, for example, a sugar, a glycolipid, or a protein. Preferred 
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targeting is accomplished by using an antibody to target the retroviral vector. 
Those of skill in the art will know of, or can readily ascertain without undue 
experimentation, specific polynucleotide sequences which can be inserted into the 
retroviral genome or attached to a viral envelope to allow target specific delivery 
5 of the retroviral vector containing the HIF-1a nucleotide sequence. 

Since recombinant retroviruses are defective, they require assistance in order 
to produce infectious vector particles. This assistance can be provided, for 
example, by using helper cell lines that contain piasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences within 

10 the LTR. These piasmids are missing a nucleotide sequence which enables the 
packaging mechanism to recognize an RNA transcript for encapsidation. Helper 
cell lines which have deletions of the packaging signal include, but are not limited 
to V2, PA317 and PA12, for example. These cell lines produce empty virions, 
since no genome is packaged. If a retroviral vector is introduced into such cells in 

15 which the packaging signal is intact, but the structural genes are replaced by 

other genes of interest, the vector can be packaged and vector virion produced. 

Alternatively, NIH 3T3 or other tissue culture cells can be directly transfected 
with piasmids encoding the retroviral structural genes gag, pol and env, by 
conventional calcium phosphate transfection. These cells are then transfected 

20 with the vector plasmid containing the genes of interest. The resulting cells 
release the retroviral vector into the culture medium. 

Another targeted delivery system for HIF-1a nucleotides is a colloidal 
dispersion system. Colloidal dispersion systems include macromolecule 
complexes, nanocapsules, microspheres, beads, and lipid-based systems 

25 including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 

preferred colloidal system of this invention is a liposome. Liposomes are artificial 
membrane vesicles which are useful as delivery vehicles in vitro and in vivo. It 
has been shown that large unilamellar vesicles (LW), which range in size from 
0.2-4.0 um can encapsulate a substantial percentage of an aqueous buffer 

30 containing large macromolecules. RNA, DNA and intact virions can be 

encapsulated within the aqueous interior and be delivered to cells in a biologically 
active form (Fraley, et al. (1981) Trends Biochem. Sci. 6:77). In addition to 
mammalian cells, liposomes have been used for delivery of polynucleotides in 
plant, yeast and bacterial cells. In order for a liposome to be an efficient gene 
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transfer vehicle, the following characteristics should be present: (1) encapsulation 
of the genes of interest at high efficiency while not compromising their biological 
activity; (2) preferential and substantial binding to a target cell in comparison to 
non-target cells; (3) delivery of the aqueous contents of the vesicle to the target 
cell cytoplasm at high efficiency; and (4) accurate and effective expression of 
genetic information (Mannino et aL (1988) Biotechniques 6:682). 

The composition of the liposome is usually a combination of phospholipids, 
particularly high-phase-transition-temperature phospholipids, usually in 
combination with sterols, especially cholesterol. Other phospholipids or other 
lipids may also be used. The physical characteristics of liposomes depend on pH, 
ionic strength, and the presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl 
compounds, such as phosphatidyl-glycerol, phosphatidylcholine, 
phosphatidylserine, phosphatidylethanolamine, sphingolipids, cerebrosides, and 
gangliosides. Particularly useful are diacylphosphatidyl-glycerols, where the lipid 
moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon atoms, 
and is saturated. Illustrative phospholipids include egg phosphatidylcholine, 
dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine. 

The targeting of liposomes can be classified based on anatomical and 
mechanistic factors. Anatomical classification is based on the level of selectivity, 
for example, organ-specific, cell-specific, and organelle-specific. Mechanistic 
targeting can be distinguished based upon whether it is passive or active. Passive 
targeting utilizes the natural tendency of liposomes to distribute to cells of the 
reticuloendothelial system (RES) in organs which contain sinusoidal capillaries. 
Active targeting, on the other hand, involves alteration of the liposome by coupling 
the liposome to a specific ligand such as a monoclonal antibody, sugar, glycolipid, 
or protein, or by changing the composition or size of the liposome in order to 
achieve targeting to organs and cell types other than the naturally occurring sites 
of localization. 

The surface of the targeted delivery system may be modified in a variety of 
ways. In the case of a liposomal targeted delivery system, lipid groups can be 
incorporated into the lipid bilayer of the liposome in order to maintain the targeting 
ligand in stable association with the liposomal bilayer. Various linking groups can 
be used for joining the lipid chains to the targeting ligand. 
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Due to the biological activity of HIF-1 in enhancing synthesis of VEGF, EPO, 
and glycolytic enzymes, there are a variety of applications using the polypeptide 
or nucleotide of the invention. Such applications include treatment of hypoxia- 
related tissue damage and HIF-1 -mediated disorders, In addition, HIF-1 may be 
5 useful in various gene therapy procedures. HIF-1 can be used to prevent or 

repair hypoxia-mediated tissue damage. Important applications include the 
treatment of cerebral and coronary artery disease. 

Conversely, blocking HIF-1 action either with anti-HIF-l antibodies, anti-HIF- 
1a antibodies, or with an HIF-1 a antisense nucleotide might slow or ameliorate 

10 diseases dependent on HIF-1 action, e.g., V-EGF-promoted tumor 

vascularization. The above described method for delivering an HIF-1 a nucleotide 
are fully applicable to delivery of an HIF-1 antagonist for specific blocking of HIF-1 
expression and/or activity when desirable. An HIF-1 antagonist can be an HIF-1 
antibody, an HIF-1a antibody, an HIF-1a antisense nucleotide sequence, or the 

15 polypeptide or nucleotide of an HIF-1 a variant. 

The isolation and purification of HIF-1 from EPO-producing Hep3B cells and 
non-EPO-producing HeLa S3 cells is described in Examples 1-3. HIF-1 protein 
was purified 1 1 ,250-fold by DEAE ion-exchange and DNA affinity 
chromatography. Analysis of HIF-1 revealed 4 polypeptides having molecular 

20 weights of 91, 93, 94 (HIF-1 JJ) and 120 kDa (HIF-1 a). Glycerol gradient 

sedimentation analysis indicates that HIF-1 exists predominantly as a heterodimer 
and to a lesser extent as a heterotetramer. 

The HIF-1 a polypeptide was isolated and sequenced. Its cDNA was 
generated by PCR and its sequence determined. The HIF-1 a polypeptide is 

25 characterized as a basic-helix-loop-helix (bHLH) polypeptide containing a PAS 

domain whose expression is regulated by cellular O z tension (Examples 4-7). 

Induction of the transcription of genes encoding the glycolytic enzymes by 
HIF-1 was investigated (Example 9). The studies revealed that the glycolytic 
enzymes aldolase A (ALDA), phosphoglycerate kinase 1 (PGK1), and pyruvate 

30 kinase M (PKM) are induced by exposure of cells to HIF-1 inducers (1% 0 2 , 

CoCI 2 , DFX). These genes have HIF-1 binding sites which were shown to 
specifically bind HIF-1 . These results support the role of HIF-1 as a mediator of 
adaptive responses to hypoxia that underlie cellular and systemic oxygen 
homeostasis. 
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A dominant-negative variant of HIF-1 a was generated lacking the basic 
domain (amino acid 17-30) of the protein which is required for the binding of HIF-1 
to DNA (Example 10). The variant H1F-1cc subunit can dimerize with HIF-1 3, but 
the resulting heterodimer cannot bind DNA. In cells overexpressing the variant 
5 HIF-1 a subunit, the majority of the HIF-1 p subunits were engaged in non- 

functional heterodimers, resulting in functional inactivation of HIF-1. These 
results show that the HIF-1 a variant is useful in vivo for blocking HIF-1 activity. 

The following examples are intended to illustrate but not limit the invention. 
While they are typical of those that might be used, other procedures known to 
10 those skilled in the art may alternatively be used. 



Example 1. Experimental Methods . 

Human HIF-1 was purified, and its DNA binding activity characterized as 
follows. 

Cell Culture and Nuclear Extract Preparation . Human Hep3B ant HeLa 

15 cells were maintained and treated with 1% O z and CoCI 2 (Wang & Semenza 

(1993a) Proc. Natl. Acad. Sci. USA 90:4304-4308), and nuclear extracts were 
prepared as described previously (Semenza & Wang (1992) Mol. Cell. Biol. 
12:5447-5454; Dignam et al. (1983) Nucleic Acids Res. 11:1474-1489). HeLa S3 
cells, obtained from American Type Culture Collection were adapted to 

20 suspension growth in Spinner's minimum essential medium supplemented with 

5% (v/v) horse serum (Quality Biological, Gaithersburg, MD). The cells were 
grown to a density of 8 x 10 s cells/ml and maintained by dilution to 2 x 10 s cells/ml 
with fresh complete medium every 2 days. For induction of HIF-1 DNA binding 
activity, HeLa S3 cells were treated with 125 uM CoCI 2 for 4 h at 37 ©c before 

25 harvesting by centrifugation for 10 min at 2,500 x g. Cell pellets were washed 

twice with ice cold phosphate-buffered saline and resuspended in 5 packed cell 
volumes of buffer A (1 0 mM Tris-HCI (pH 7.6), 1 .5 mM MgCI 2 , 1 0 mM KCI) 
supplemented with 2 mM dithiothreitol (DTT), 0.4 mM phenylmethylsulfonyl 
fluoride and 1 mM Na 3 V0 4 . After incubation on ice for 10 min, cells were pelleted 

30 at 2,500 x g for 5 min, resuspended in 2 packed cell volumes of buffer A, and 

lysed by 20 strokes in a glass Dounce homogenizer with type B pestle. Nuclei 
were pelleted at 10,000 x g for 10 min and resuspended in 3.5 packed nuclear 
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volumes of buffer C (0.42 M KCI f 20 mM Tris-HCI (pH 7.6), 20% glycerol, 1.5 mM 
MgCI 2 ) supplemented with 2 mM DTT, 0.4 mM phenylmethylsulfonyl fluoride, and 
1 mM Na 3 V0 4 . Nuclear proteins were extracted by stirring at 4oC for 30 min. 
After centrifugation at 15,000 x g for 30 min, the supernatant was dialyzed against 
5 buffer Z-100 (25 mM Tris-HCI (pH 7.6), 0.2 mM EDTA, 20% glycerol, 2 mM DTT, 

0.4 mM phenylmethylsulfonyl fluoride, 1 mM Na 3 V0 4 , and 100 mM KCI) at 4oC. 
The dialysate was clarified by ultracentrifugation at 100,000 x g for 60 min at 4oC, 
and designated as crude nuclear extract. The nuclear extracts were aliquoted, 
frozen in liquid N 2 , and stored at -80oC. Protein concentration was determined by 

10 the method of Bradford (1976) Anal. Biochem. 72:248-254, with a commercial kit 

(Bio-Rad) using bovine serum albumin (BSA) as a standard. 

Gel shift assays . Gel shift assays were performed as described (Semenza & 
Wang (1992) Mol. Cell. Biol. 12:5447-5454, herein specifically incorporated by 
reference) except that the binding reaction was in buffer Z-100. For gel shift 

15 assays with partially purified and affinity-purified HIF-1 preparations, 0.25 mg/ml 

of BSA and 0.05% Nonidet P-40 were included in the binding reaction. 
Nonspecific competitor calf thymus DNA (Sigma) was used in reduced amounts 
for partially purified fractions, and no calf thymus DNA was used for affinity- 
purified HIF-1 fractions. For competition experiments, unlabeled oligonucleotide 

20 DNA was incubated with DEAE-Sepharose column fractions for 5 min on ice 

before probe DNA was added. 

Nuclear extracts prepared from HeLa cells cultured in the presence of 0, 5, 
10, 25, 50, 75, 100, 250, 500 or 1000 uM CoCI 2 for 4 h at 37oC, were incubated 
with W18 probe. 

25 Methvlation interference analysis . Methylation interference analysis was 

performed as described (Wang & Semenza (1993b) J. Biol. Chem. 268:21513- 
21518, herein specifically incorporated by reference), except 100 ug of nuclear 
extract prepared from CoCI 2 -treated HeLa cells were used in the binding 
reactions. 

30 Results . To determine the optimal concentration of CoCI 2 for induction of 

HIF-1 DNA binding activity, HeLa cells were treated with CoCI 2 . Nuclear extracts 
were prepared and analyzed by gel shift assay with the wild-type oligonucleotide 
W18 (Example 2) as probe. Results are shown in Fig. 1 . Induction of HIF-1 DNA 
binding activity by CoCI 2 was dose-dependent. HIF-1 activity in nuclear extracts 
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was detected at 25 uM CoCI 2 and reached a peak activity at 250 uM. Significant 
cell death, however, was observed at CoCI 2 concentrations greater than 250 uM, 
resulting in decreased yield of nuclear proteins. For this reason 125 uM CoCI 2 
was chosen for subsequent large scale nuclear extract preparation. Constitutive 
5 DNA binding activities, which also bind W18 probe sequence specifically 

remained relatively unchanged in cells treated with 0-100 uM CoCI 2l and 
decreased at CoCI 2 concentration greater than 250 uM, suggesting an adverse 
effect of high CoCI 2 concentration on the cells. Nonspecific DNA binding activities 
were barely detectable in this particular gel shift assay and vary with cell type and 

10 the relative amount of nonspecific competitor DNA used. 

Methylation interference analysis was performed to determine if HIF-1 from 
hypoxic Hep3B cells and CoCI 2 .treated HeLa cells has the same DNA binding 
properties. As shown in Fig. 2, methylation of G 8 or G 10 on the coding strand 
eliminated or greatly reduced HIF-1 binding, respectively (Fig. 2, left, lane 2). 

15 Methylation of G 10 only partially interfered with the binding of constitutive factors 

(Fig. 2, left, lanes 3 and 4). On the noncoding strand, methylation of G 7 or G„ 
blocked HIF-I binding to the probe (Fig. 2B, right, lane 2). Only the methylation of 
G 7 interfered with binding of constitutive factors (Fig. 2B, right, lanes 3 and 4). 
The nonspecific binding activity was unaffected by DNA methylation on either 

20 strand (Fig. 2A, left, lane 5 and Fig. 2B t right, lane 5). The results indicate that (i) 

HIF-1 closely contacts G 8 and G 10 on the coding strand and G 7 and G„ on the 
noncoding strand through the major groove of the DNA helix, and (ii) HIF-1 and 
the constitutive DNA binding factors can be distinguished by the nature of their 
DNA binding site contacts. 
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Example 2. Biochemical Purification of HIF-1 . 

Preparation of DNA affinity columns . DNA affinity columns were prepared by 
coupling multimerized double-stranded oligonucleotides to CNBr-activated 
Sepharose (Kadonaga & Tijan (1986) Proc. Natl. Acad. Sci. USA 83:5889-5893). 
5 The wild-type and the mutant column contained multimerized oligonucleotide W18 

(SEQ ID NO:5) 

and M18 (SEQ ID NO:6) (mutation underlined), respectively. 



W1 8: 5 , -gatcGCCCTACGTGCTGTCTCA-3 , 
3-CGGGATGCACGACAGAGTctag-5' 

10 M 1 8: 5 l -gatcGCCCTAAAAGCTGTCTCA-3 t 

3'-CGGGATTTTCGACAGAGTctag-5' 



Equal amounts of complementary oligonucleotides were annealed, 
phosphorylated, and ligated. Ligated oligonucleotides (60-500 bp) were extracted 
with phenol/chloroform, ethanol precipitated, resuspended in deionized water, and 

15 coupled to CNBr-activated Sepharose 4B as instructed by the manufacturer 

(Pharmacia Biotech Inc.). Approximately 50 ug of ligated double-stranded 
oligonucleotides were coupled per ml of Sepharose. 

Purification of HIF-1 . Crude nuclear extracts from 120 liters of CoCI 2 -treated 
HeLa S3 cells (435 ml, 3,040 mg) were thawed on ice and clarified by 

20 centrifugation at 15,000 x g for 10 min. Extracts were fractionated as three 

batches over a 36 ml DEAE-Sepharose CL-6B column (Pharmacia) in buffer Z- 
100 with a step gradient of increasing KCI. Fractions containing peak activity 
were pooled and dialyzed against buffer Z-100. The dialysate from DEAE- 
Sepharose columns was incubated with calf thymus DNA (Sigma) at a 

25 concentration of 4.4 ug/ml for 15 min on ice. After centrifugation at 15,000 x g for 

10 min, the supernatant (240 ml; 2.3 mg/ml) was applied to a 6 ml DNA affinity 
column prepared with concatenated W18 oligonucleotide. The fractions 
containing HIF-1 activity were pooled and dialyzed against buffer Z-100. The 
dialysate from the first DNA-affinity column was mixed with calf thymus DNA at a 

30 concentration of 2.5 ug/ml and incubated on ice for 15 min. After centrifugation 

(as described above), the supernatant was applied to a 1.5 ml M18 DNA- 
Sepharose column. The flowthrough from the M18 column was collected and 
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reapplied to a second 2 ml W18 column. All buffers used for DNA affinity 
chromatography were supplemented with 0.05% Nonidet P-40 and 5 mM DTT. 
The amount of protein in affinity column fractions was quantitated by silver 
staining of SDS-polyacrylamide gels or by Amido Black (Sigma) staining of 
nitrocellulose membranes (Schleicher & Schuell) spotted with protein samples 
and compared against known amounts of proteins standards (Bio-Rad). 

For purification of HIF-1 from hypoxia-treated Hep3B cells, nuclear extracts 
(95 mg) were fractionated by the use of a 4 ml DEAE-Sepharose CL-6B column 
as described above. 0.25 M KCI elute fractions were dialyzed against buffer 2- 
100 and applied onto a Sephacryl S-300 gel filtration column (50 ml, 1.5 x 30 cm). 
The fractions containing HIF-1 activity were pooled an applied to a 2 ml calf 
thymus DNA column (0.8 mg of calf thymus DNA/ml of Sepharose) prepared by 
coupling single-stranded calf thymus DNA to CNBr-activated Sepharose 4B. The 
flowthrough was collected and applied to a 0.4 ml W18 column as described 
above after incubation with calf thymus DNA (2.2 ug/ml) for 10 min followed by 
another 0.2 ml W18 column after dialysis against buffer Z-1 00. 

SDS-PAGE and Silver Staininn SDS-PAGE was carried out as described by 
Laemmli (1970) Nature 227:680-685. The gels were calibrated with high range 
molecular weight standards or prestained molecular weight markers (Bio-Rad). 
Electrophoresis was performed at 30 mA. Silver staining was performed with 
silver nitrate as described (Switzer et al. (1979) Anal. Biochem. 98:231-237). 
Molecular weight estimation for HIF-1 polypeptides was based on SDS- 
polyacrylamide gels with 3.2% cross-linking (acrylamide/bisacrylamide ration of 
30:1). 

Results - Since HIF-1 DNA binding activity from hypoxic Hep3B cells 
and CoCI 2 -treated HeLa cells are indistinguishable (Example 1), HeLa S3 cells 
treated with 125 uM CoCI 2 were used as starting material for the large scale 
purification of HIF-I. To purify HIF-1 by DNA affinity chromatography, the 
constitutive DNA binding activity had to first be separated from HIF-I since both 
bind specifically to the W18 DNA sequence. Various ion-exchange resins and gel 
filtration matrices were examined. HIF-1 was retained on DEAE anion-change 
resins in buffer Z-1 00, whereas constitutive DNA binding activity was found in the 
flowthrough. HIF-1 DNA binding activity was eluted with 250 mM KCI in buffer Z. 
DEAE-Sepharose chromatography effectively removed constitutive DNA binding 
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activity and resulted in a 4-foJd purification of HIF-1 (Fig. 3A, lanes 1 and 2). This 
step, however, appeared to destabilize the HIF-1 protein complex and resulted in 
a faster migrating form of HIF-1 (Fig. 3A, lane 2, second arrow), which was also 
occasionally seen in crude nuclear extract preparations. This faster migrating 
5 form could be converted to the slower migrating HIF-1 band at higher salt 

concentrations, and HIF-I appeared predominantly as the slower migrating form 
again after the first round of DNA affinity column chromatography (Fig. 3A, lanes 
10-12), suggesting that no HIF-1 component was lost during the DEAE- 
Sepharose chromatography step. Probe binding of both HIF-1 forms could be 

10 competed by unlabeled W18 (Fig. 3B, lanes 2-4) but not M18 oligonucleotide (Fig. 

3B, lanes 5-7), which contained a three-base pair substitution that abolished the 
ability of the EPO enhancer to mediate hypoxia-inducible transcription. 

Partially purified HIF-1 fractions were then incubated with nonspecific 
competitor calf thymus DNA at concentrations that allowed optimal detection of 

15 HIF-1 DNA binding activity by gel shift assays and applied to a W18 DNA affinity 

column. Eluted fractions containing HIF-I (0.5 M KCI, Fig. 3A, lane 10; 1 M KCI, 
Fig. 3A, lane 11) were pooled and dialyzed against buffer Z-1 00. To eliminate 
nonspecific DNA-binding proteins that were not removed by calf thymus DNA 
competitor, the dialysate was applied to an M18 DNA column. HIF-I DNA binding 

20 activity was detected in the flowthrough, which was then applied directly onto 

second W18 column. HIF-I activity was detected exclusively in 0.5 M KCI 
fractions. Two rounds of W18 and one round of M18 column chromatography 
resulted in a purification of approximately 2,800-fold. 

The results of the final large scale purification are summarized in Table 1. 

25 From 120 liters of HeLa cells, approximately 60 u g of highly purified HIF-1 were 

obtained. The total purification was 11,250-fold and yielded approximately 22% of 
the starting of HIF-1 DNA binding activity. Our objective was to identify HIF-1 
subunits and isolate HIF-1 components for the purpose of peptide mapping and 
protein microsequencing analysis. Since additional steps of purification resulted 

30 in markedly lower yield, we did not purify HIF-1 further to homogeneity. Aliquots 

from flowthrough of the M18 column (Fig. 4A, Load) as well as the 0.25 M KCI 
wash and 0.5 M KCI elute fractions of the second W18 column were analyzed by 
6% SDS-PAGE and silver staining. Four polypeptides of 90-120 kDa were highly 
enriched in the 0.5 M KCI fraction, which had high HIF-1 DNA binding activity 
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compared with the 0.25 M KCI fraction, which had very little HIF-I activity. The 0.5 
M KCI fraction, however, still had many of the contaminant proteins found in the 
0.25 M KCI fraction. 

In an initial pilot purification of HIF-1 from hypoxia-induced Hep3B cells, a 
different purification protocol was used. Gel filtration over a Sephacryl S-300 
column was also found to be effective in separating HIF-1 from constitutive DNA 
binding activity. In addition, a calf thymus DNA column was used to remove 
nonspecific DNA-binding proteins prior to two rounds of W1 8 DNA affinity 
chromatography. HIF-I activity was detected in 0.5 M KCI fractions from both 
DNA affinity columns. An aliquot from the 0.5 M KCI elute fraction of the first W18 
column (Fig. 4B, Load) as well as the 0.25 M KCI wash and 0.5 M KCI elute 
fractions of the second W18 column were analyzed by 7% SDS-PAGE and silver 
staining. Four polypeptides of similar molecular mass to those that co-purified 
with HIF-1 DNA binding activity in CoCI 2 -treated HeLa cells were present in the 
affinity-purified preparation from hypoxic Hep3B cells (Fig. 4B, lane 3, arrows), 
indicating that HIF-1 from the two different cell types is composed of the same 
polypeptide subunits. Affinity-purified HIF-1 from both CoCI 2 -treated HeLa cells 
and hypoxic Hep3B cells bound specifically to the W18 probe in gel shift assays. 
Example 3. Analysis of HIF-1 Subunits 

The following experiments were conducted to identify polypeptides that are 
part of the HIF-1 DNA binding complex. 

Preparative gel shift assays were performed with 30 ul of affinity-purified HIF- 
1 and probe W18. Gel slices containing HIF-1 and surrounding areas were 
isolated after autoradiography with wet gel. Gel slices were placed on the 
stacking gel of a 6% SDS-polyacrylamide gel and incubated with Laemmli buffer 
in situ for 15 min, and electrophoresis was performed in parallel with 30 ul of 
affinity-purified HIF-1 and molecular weight markers. For two-dimensional 
denaturing gel electrophoresis, two aliquots of affinity-purified HIF-1 were 
resolved on a 6% SDS-polyacrylamide gel with 5% cross-linking 
(acrylamide/bisacrylamide ratio of 19:1). One lane was stained with silver nitrate. 
The gel slices corresponding to regions of interest were isolated from the 
unstained lane. The isolated gel slices were placed directly on the stacking gel of 
the second dimension 6% SDS-polyacrylamide gel with 3.2% cross-linking, and 
electrophoresis was performed in parallel with 30 ul of affinity purified HIF-1 
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Peptide Mapping of H1F-1 Subunrts . 2 ml of the affinity-purified HIF-1 were 
dialyzed against 10 mM ammonium bicarbonate, 0.05% SDS and lyophilized. 
After resuspension in a solubilizing solution (100 mM sucrose, 3% SDS, 21.25 
mM Tris-HCI (pH 6.9), 1 mM EDTA, 5% p-mercaptoethanol, 0.005% bromphenol 
5 blue), the protein samples were heated to 37©C for 15 min and resolved on a 6% 

polyacrylamide gel containing 0.2% SDS. Polypeptides were transferred 
electrophoretically at 4oC to a polyvinylidene difluoride membrane (Bio-Rad) in 
0.5 x Towbin buffer (Towbin et al. 91979) Proc. Natl. Acad. Sci. USA 76:4350- 
5354) (96 mM glycine, 12.5 mM Tris-HCI (pH 8.3)) with 10% acetic acid, 

10 destained with 5% acetic acid and rinsed with Milli-Q water. Membrane slices 
containing the HIF-1 polypeptides of 120, 94/93, and 91 kDa were excised and 
subjected to peptide mapping (Best et al. (1994) in Techniques in Protein 
Chemistry V (Crabb, J.W., ed.), pp. 205-213, Academic Press, San Diego, CA). 
In situ tryptic digestion and reverse phase HPLC were performed by the Wistar 

15 Protein Microchemistry Laboratory. 

UV Cross-Linking Analysis . UV cross-linking was carried out as described 
(Wang & Semenza (1993) Proc. Natl. Acad. Sci. USA 90:4304-4308) except that 
30 ul of affinity-purified HIF-1 were used in the binding reaction. Affinity-purified 
HIF-1 was incubated with W18 probe in the absence or presence of unlabeled 

20 W18 or M1 8 oligonucleotide. After incubation for 15 min at 4oC f the reaction 

mixtures were irradiated with UV light (312 nm; Fisher Scientific) for 30 min and 
resolved by 6% SDS-PAGE with pre-stained molecular weight markers and 
visualized by autoradiography. 

Glycerol Gradient Sedimentation . Linear gradients of 12 ml, 10-30% glycerol 

25 in a buffer containing 100 mM KCI, 25 mM Tris-HCI (pH 7.6), 0.2 mM EDTA, 5 

mM DTT, and 0.4 mM phenylmethylsulfonyl fluoride, were prepared for 
centrifugation in a Beckman SW40 rotor for 48 h at 4©C. Nuclear extract 
prepared from hypoxic Hep3B cells (100 ul, 5 mg/ml) was mixed with an equal 
volume of glycerol gradient buffer containing 10% glycerol and layered on the top 

30 of the gradient. A marker gradient was sedimented in parallel and contained 50 

ug each of thyroglobulin (660 kDa), ferritin (440 kDa), catalase (232 kDa), 
aldolase (158 kDa), and BSA (67 kDa) (Pharmacia). Markers were adjusted to 
the same volume and glycerol concentration as the sample. Fractions (0.5 ml) 
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were collected from the top of the tubes, and DNA binding activity was measured 
by the gel shift assay. Markers were assayed by SDS-PAGE and silver staining. 

Results. In order to identify polypeptides that are part of the HIF-1 DNA 
binding complex, preparative gel shift assays were performed with affinity-purified 
HIF-I and W18 probe. Gel slices containing the HIF-1 -DNA complex were 
isolated, inserted directly into the wells of an SDS-polyacrylamide gel, and 
analyzed by electrophoresis in parallel with an aliquot of affinity-purified HIF-1 
(Fig. 5A). Four polypeptides present in the HIF-1 complex migrated with an 
apparent molecular weight of 120, 94, 93, and 91 kDa, respectively (Fig. 5A, HIF- 
1). None of these peptides were detected in gel slices isolated from other regions 
of the same lane. These four polypeptides migrated at the same positions as the 
polypeptides that co-purified with HIF-1 DNA binding activity by DNA affinity 
chromatography (Fig. 5A, lane A). The 120 kDa polypeptide and the 91-94 kDa 
polypeptides appear to be present in an equimolar ratio, suggesting that the 120 
kDa polypeptide forms complexes with any one of the 91-, 93-, and 94 kDa 
polypeptides. 

On a 6% SDS-polyacrylamide gel with 3.2% cross-linking, the 120 kDa HIF-1 
polypeptide migrated very close to a contaminant polypeptide of slightly greater 
apparent molecular weight (Fig. 5A, lane A), making isolation of the 120 kDa 
polypeptide difficult. This problem was resolved by separating the HIF-1 
polypeptides on a 6% SDS-polyacrylamide gel with 5% cross-linking. The 120 
kDa polypeptide migrated much faster on the more highly cross-linked gel relative 
to the migration of the 1 16 kDa molecular mass marker, whereas migration of the 
contaminant band (*1) was unchanged (Fig. 5B, lane A). Under these conditions, 
however, the 91 kDa polypeptide ran very close to another contaminant band (*2) 
below it. Two polyacrylamide gel systems with different degrees of crosslinking 
were therefore required for the isolation of the 91-94 kDa and the 120 kDa HIF-1 
polypeptides, respectively. 

To confirm that the HIF-1 polypeptides identified by the two gel systems were 
identical, two dimensional denaturing gel electrophoresis was performed. 
Affinity-purified HIF-1 was first resolved on a 6% SDS-polyacrylamide gel with 5% 
crosslinking (as in Fig. 5B, lane A). Regions of the gel containing the 120 kDa, 
94/93/91-kDa HIF-1 polypeptides, as well as the two contaminant bands, were 
isolated and analyzed by electrophoresis on a 6% SDS-polyacrylamide gel with 
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3.2% crosslinking in parallel with an aliquot of the affinity-purified HIF-I. As shown 
in Fig. 5C, the isolated HIF-1 and contaminant polypeptides co-migrate with the 
corresponding bands in the control sample, indicating that the differences in their 
migration were due to different degrees of cross-linking of the 
5 SDS-polyacrylamide gels. 

To determine whether the four polypeptides from the HIF-I complex represent 
distinct protein species, tryptic peptide mapping was performed. The 91 kDa 
band was isolated individually while the 93 and 94 kDa bands were excised to- 
gether after electrophoretic separation and transfer to a polyvinylidene difluoride 

10 membrane. Proteins were digested with trypsin in situ, and the tryptic peptides 
were separated by reverse phase HPLC (Fig. 6). The elution profiles of tryptic 
peptides derived from 91 kDa protein and 93/94 kDa proteins were nearly 
superimposable (Fig. 6), suggesting that they were derived from similar 
polypeptides. Another aliquot of HIF-1 was resolved on a 6% polyacrylamide gel 

15 of 5% crosslinking for isolation of the 120 kDa HIF-1 polypeptide. The tryptic 

peptide elution profile derived from the 120 kDa polypeptide was distinct from 
those of the 91-94 kDa polypeptides. These results suggest that HIF-1 is 
composed of two different subunits, 120 kDa HIF-1a and 91/93/94 kDa HIF-ip. 
To identify the DNA-binding subunit(s), affinity-purified HIF-1 was incubated 

20 with W18 probe. After UV irradiation to cross-link the DNA-binding proteins to 

nucleotide residues at the binding site, the reaction mixtures were boiled in 
Laemmli buffer and resolved by SDS-PAGE, and cross-linked proteins were 
visualized by autoradiography. Two DNA-binding proteins were detected (Fig. 7, 
lane 1). Their molecular masses were estimated to be approximately 120 and 92 

25 kDa (after the 16 kDa molecular mass contributed by probe DNA was subtracted), 

similar to those of HIF-lct and HIF-1 p. The binding of both proteins to the probe 
was sequence-specific since it could be competed by unlabeled wild-type W18 
(Fig. 7, lane 2) but not mutant M18 (Fig. 7, lane 3) oligonucleotide. These results 
suggest that both HIF-la and HIF-1 (J contact DNA directly. HIF-la was 

30 cross-linked to DNA much more strongly than HIF-1 p (fig. 7, lanes 1 and 3). 

These data provided further evidence that the four polypeptides purified by DNA 
affinity chromatography are bona fide components of HIF-1 DNA binding activity. 

To estimate the native size of HIF-1, glycerol gradient sedimentation analysis 
was performed with crude nuclear extract prepared from hypoxic Hep3B cells. 
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HIF-1 and the constitutive DNA binding activity were monitored by gel shift 
assays. In hypoxic Hep3B nuclear extracts. HIF-I-DNA complexes are present in 
two forms, whereas in CoCI 2 -treated HeLa extracts, the faster migrating form 
predominates. The results, shown in Fig. 8, demonstrate that the two bands of 
the HIF-1 doublet are separable by sedimentation. The faster migrating form was 
estimated to have a molecular mass of approximately 200-220 kDa. Longer 
exposure of the autoradiograph revealed that the slower migrating band co- 
migrated with ferritin, which has a molecular mass of 440 kDa. Assuming a 
globular conformation for both protein complexes, these results are consistent 
with the hypothesis that the faster migrating form represents a heterodimeric com- 
plex, consisting of a 120 kDa HIF-1 a subunit and a 91-94 kDa HIF-lp subunit, 
whereas the slower migrating form may represent a heterotetramer. The exact 
nature and stoichiometry of these HIF-I complexes, however, remains to be 
determined. The constitutive DNA binding activity has a molecular mass less 
than the 67 kDa BSA protein. Since UV cross-linking analysis indicated that the 
constitutive factor has a DNA-binding subunit of approximately 40-50 kDa, it is 
most likely that the constitutive factor binds DNA as a monomer. Consistent with 
the results of glycerol gradient sedimentation analysis. HIF-I eluted from a 
Sephacryl S-300 gel filtration column before the constitutive binding activity, and 
the slower migrating HIF-I gel shift activity eluted before the faster migrating form. 
These results suggest that HIF-I exists predominantly as a heterodimer in solution 
and to a lesser extent as a higher order complex, and that these complexes 
contain at least one HIF-lct and one HIF-1 3 subunit. 

Example 4. Isolation and Characteris ation of HIF-lg C DNA SP T ,on^. 

Protein microsequence analysis . Purified HIF-I subunits were fractionated by 
SDS-polyacrylamide gel electrophoresis, and the 120 and 94 kDa polypeptides 
were transferred to polyvinylidene difluoride membranes, individually digested 
with trypsin in situ and peptides were fractionated by reverse-phase high-pressure 
liquid chromatography (Wang & Semenza (1995) J. Biol. Chem. 270:1230-1237, 
herein specifically incorporated by reference). Protein microsequence analysis 
was performed at the Wistar Protein Microchemistry Laboratory, Philadelphia 
(Best et al. (1994) supra). 

cDNA library construction and screening . Poly (A)+ RNA was isolated from 
Hep3B cells cultured for 16 h at 37 °C in a chamber flushed with 1% 0^5% 
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CO^balance N 2 . cDNA was synthesized using oligo(dT) and random hexamer 
primers and bacteriophage libraries were constructed in Agt1 1 and Uni-ZAP XR 
(Stratagene, La Jolia CA). cDNA libraries were screened with 32 P-labelled cDNA 
fragments by plaque hybridization as described (Sambrook et al. (1989) Molecular 
5 Cloning: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, 

Plainview, NY, herein specifically incorporated by reference). 

PCR . Degenerate oligonucleotides primers were designed using codon 
preference rules (Lathe (1985) J. Mol. Biol. 183:1-12). ccF1 

(5 , -ATCGGATCCATCACIGA(A/G)CT(C/G)-ATGGGITATA-3 , ) (SEQ ID NO:7) was 

10 based upon the amino terminus of HIF-la peptide 87-1 and used as a forward 
primer. Two nested reverse primers, aR1 (5-ATTAAGCmTGGT- 
(G/C)AGGTGGTCI(G/C)(A/T)GTC-3') (SEQ ID NO:8) and ccR2 (5'- 
ATTAAGCTTGCATGGTAGTA(T/C)TCATAGAT-3 , ) (SEQ ID NO:9), were based 
upon the carboxy terminus of peptide 91-1. PCR was performed by: 

15 denaturation of 108 phage or 10 ng of phage DNA at 95°C for 10 min; addition of 

AmpliTaq (Perkin-Elmer) at 80°C; and amplification for 3 cycles at 95°C, 37°C, 
and 72°C (30 sec each) followed by 35 cycles at 95°C, 50°C, and 72°C (30 sec 
each). Nested PCR with ccF1/aR1 and then ccF1/aR2 generated an 86-bp 
fragment which was cloned into pGEM4 (Promega). For HIF-ip (ARNT), PCR 

20 was performed as described above using primers 

5 , -ATAAAGCTTGT(C/G)TA(CyT)GT-(C/G)TCIGA(CyT)TCIG-3 , (SEQ ID NO: 1 0) 
and 5* ATCG AATTC(C7T)TC I -G ACTG I GGCTGGTT-SXS EQ ID NO:11) which 
resulted in the predicted 69-bp product. For analysis of the 5' end of HIP-1 p 
(ARNT), Hep3B poly(A)+ RNA was reverse-transcribed using reagents from a 

25 5-RACE kit (Clontech). The cDNA was used as template to amplify nt 54-425 of 

ARNT cDNA (Hoffman et al. (1991) supra) , with 

S'-TACGGATCCGCCATGGCGGCGACT-ACTGA-S' (SEQ ID NO:12) (forward 
primer) and nested reverse primers S-AGCCAGGGCACTACAGGTGGGTACC-S' 
(SEQ ID NO:13) and 5 , GTTCCCCGCAAGGACTTCATGTGAG-3 , (SEQ ID NO:14) 
30 for 35 cycles at 95°C, 60°C, and 72°C (30 sec each). PCR products were cloned 

into pGEM4 for nucleotide sequence analysis. 

Results . The purified 120 kDa HIF-la polypeptide was digested with trypsin, 
peptides were fractionated by reverse-phase high-pressure liquid chromatography 
and fractions 87 and 92 were subjected to microsequencing. Each fraction 
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contained two tryptic peptides, for which virtually complete amino acid sequences 
were obtained: ITELMGYEPEELLGR (SEQ ID NO: 15) (87-1), XIILIPSDLAXR 
(SEQ ID NO:16) (87-2), SIYEYYHALDSDHLTK (SEQ ID NO:17) (91-1), and 
SFFLR (SEQ ID NO: 18) (91-2). When 87-1 and 91-1 were entered as contiguous 
sequences, database searches identified similarities to the Drosophila proteins 
period (PER) and single-minded (SIM), and the mammalian aryl hydrocarbon 
receptor (AHR) and aryl hydrocarbon receptor nuclear translocator (ARNT) 
proteins, which all contain sequences of 200-350 amino acids that constitute the 
PAS (PER-ARNT-AHR-SIM) domain (Hoffman et al. (1991) Science 252:954-958; 
Citri et al. (1987) Nature 326:42-47; Burbach et al. (1992) Proc. Natl. Acad. Sci. 
USA 89:8185-8189; Crews et al. (1988) Cell 52:143-151; Nambu et al. (1991) Cell 
67:1157-1167). Degenerate oligonucleotides were synthesized based upon the 
87-1 and 91-1 sequences and used for PCR with cDNA prepared from hypoxic 
Hep3B cells. Nucleotide sequence analysis revealed that the cloned PCR product 
encoded the predicted amino acids, demonstrating that 87-1 and 91-1 were 
contiguous peptides. 

Example 5. Nucleotide sequenc e and database analysis - Complete 

unambiguous double stranded nucleotide sequences were obtained by 
incorporation of fluorescence-labeled dideoxy nucleotides into thermal-cycle 
sequencing reactions using T3, T7, and custom-synthesized primers. Reactions 
were performed using Applied Biosystems 394 DNA Synthesizers and 373a 
Automated DNA Sequencers in the Genetics Core Resources Facility of The 
Johns Hopkins University. Protein and nucleic acid database searches were 
performed at the National Center for Biotechnology Information using the 
programs BLASTP and TBLASTN (Altschul et al. (1990) J. Mol. Biol. 215:403- 
410). The HIF-lcc cDNA nucleotide sequence and deduced amino acid sequence 
have been submitted to GenBank. The accession number is U22431 . 

Results. Database analysis also identified an expressed-sequence tag (EST) 
whose derived amino acid sequence showed similarity to bHLH-PAS proteins. 
We obtained the 3.6-kb cDNA from which the EST was derived, hbc025 (Takeda 
et al. (1993) Hum. Mol. Genet. 2:1793-1798). Complete nucleotide sequence 
analysis revealed that it encoded all four tryptic peptides. Another EST was 
identified which shared identity with hbc025 and was encoded by a 2.0-kb cDNA, 
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hbc120 (Takeda et a!. (1993) suoraV Sequence analysis of hbc120 revealed that 
it was co-linear with the 3* end of hbc025 (Fig. 9), differing only in the length of the 
poly (A) tail. The 5' end of hbc025 was used to screen a Hep3B cDNA library, 
resulting in the isolation of an overlapping 3.4-kb cDNA, 3.2-3, which extended to 
5 an initiator codon. The composite cDNA of 3720 bp encoded a 2478-bp open 

reading frame that included a translation initiation codon, a 28-bp S'-untranslated 
region (5*-UTR) that contained an in-frame termination codon, and a 121 1-bp 3'- 
UTR that ended with a canonical polyadenylation signal followed after 12 bp by 43 
adenine residues. Compared to the consensus translation-initiation sequence 

10 GCC(A/G)CCATGG (SEQ ID NO: 19) (Kozak (1987) Nucleic Acids Res. 

15:8125-8132), the HIF-la cDNA sequence is TTCACCATGG (SEQ ID NO:20). 
The HIF-1a cDNA open reading frame predicted a novel 826 amino acid 
polypeptide (Fig. 10) with a molecular mass of 93 kDa that contained a 
bHLH-PAS domain at its amino terminus. 

15 Analysis of two tryptic peptides isolated from the 94 kDa HIF-13 polypeptide 

(Wang & Semenza (1995) supra) yielded partial amino acid sequences, 
WYVSDSVTPVLNQPQSE (SEQ ID NO:21) and 

TSQFGVGSFQTPSSFSSMXLPGAPTASPGAAAY (SEQ ID NO:22). Using 
degenerate oligonucleotides based upon the second peptide sequence, a PCR 

20 product of the predicted size was amplified from Hep3B cDNA. Database 

searches identified both peptides within the sequence of ARNT, a bHLH-PAS 
protein previously shown to heterodimerize with AHR to form the functional dioxin 
receptor (Reyes et al. (1992) Science 256:1 193-1 195). Two isoforms of ARNT 
have been identified which differ by the presence or absence of a 15 amino acid 

25 sequence encoded by a 45-bp alternative exon (Hoffman et al. (1 991 ) supra l 

Analysis of Hep3B RNA by reverse transcriptase-PCR revealed the presence of 
both sequences, as well as additional isoforms. These primary sequence 
differences may account for the purification of three (91,93, and 94 kDa) HIF-lp 
polypeptides (Wang & Semenza (1995) supra) . The apparent molecular mass of 

30 both HIF-la and HIF-1p on denaturing gels was greater than the mass predicted 

from the cDNA sequence. For HIF-la the apparent mass was 120 kDa compared 
to a calculated mass of 93 kDa; for the HIF-10 subunits, the apparent masses 
were 91-94 kDa compared to calculated masses of 85 and 87 kDa for the 774 
and 789 amino acid isoforms of ARNT, respectively. The HIF-la and ARNT 
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sequences contain multiple consensus sites for protein phosphorylation and HIF-1 
has been shown to require phosphorylation for DNA binding (Wang & Semenza 
(1993b) supra). 

HIF-1 a and HIF-1 p (ARNT) belong to different classes of bHLH domains, 
which consist of contiguous DNA binding (b) and dimerization (HLH) motifs. The 
bHLH domain of HIF-1 a is most similar to the other bHLHPAS proteins, SIM and 
AHR (Fig. 11). HIF-1 p (ARNT) has greatest similarity to the bHLH domains found 
in a series of mammalian (Ml, USF, L-MYC) and yeast (CP- 1) proteins that bind 
to 5-CACGTG-3' (SEQ ID NO:23) (Dang et al. (1992) Proc. Natl. Acad. Sci. USA 
89:599-603), a sequence which resembles the HIF-1 [5'-(G/Y)ACGTGC(G/T)-3' 
(SEQ ID NO:24) (Semenza et al. (1994) supra) ] and dioxin receptor 
[S'-CnONGCGTCfA/CHG/OA-S 1 (SEQ ID NO:25) (Lusska etal. (1993) J. Biol 
Chem. 268:6575-6580)] binding sites. These transcription factors share bHLH 
domains of related sequence which occur in different dimerization contexts: Ml, 
1 5 L-MYC. and USF are bHLH-leucine zipper proteins, ARNT is a bHLH-PAS 

protein, and CP-1 contains only a bHLH domain. 

Analysis of PAS domains, which have been implicated in both ligand binding 
and protein dimerization (Huang et al. (1993) Nature 364:259-262; Dolwick et al. 
(1993) Proc. Natl. Acad. Sci. USA 90:8566-8570, Reisz-Porszasz et al. (1994) 
20 Mol. Cell. Biol. 14:6075-6086), revealed that HIF-1a is most similar to SIM. Our 

alignment established consensus sequences that include a previously unreported 
motif, HXXD, present in the A and B repeats of all PAS proteins (Fig. 12). We 
also found that KinA of Bacillus subtilis (Perego et al. (1989) J. Bacteriol. 
171:6187-6196) contains a PAS domain at its amino terminus and is thus the first 
procaryotic member of this protein family, indicating a remarkable degree of 
evolutionary conservation. KinA, like PER, possesses a PAS but not a bHLH 
domain and is thus unlikely to bind DNA. B. subtilis undergoes spoliation in 
response to adverse environmental conditions and KinA functions as a sensor 
that transmits signals via a carboxy-terminal kinase domain (Burbulys et al. (1991) 
30 Cell 64, 545-552). 

Example 6. RNA Blot Hybridization 

The expression of HIF-1 RNAs in response to inducers of HIF-1 DNA-binding 
activity was analyzed as follows. 



25 
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Total RNA (15 ug) was fractionated by 2.2 M formaldehyde/ 1.4% agarose 
gel electrophoresis, transferred to nitrocellulose membranes and hybridized at 
68°C in Quik-Hyb (Stratagene) to 32 P-labelled HIF-1 a or ARNT cDNA. Gels were 
stained with ethidium bromide and RNA was visualized by ultraviolet illumination 
5 before and after transfer to insure equal loading and transfer, respectively, in 

each lane. Based upon the migration of RNA size markers (BRL-GIBCO) on the 
same gels, the size of HIF-la RNA was estimated to be 3.7 t 0.1 kb. Two ARNT 
RNA species were identified as previously reported (Hoffman et al. (1991) supra V 
Results . When Hep3B cells were exposed to 1% O z , HIF-1cc and HIF-1p 

10 (ARNT) RNA levels peaked at 1-2 h, declined to near basal levels at 8 h, and 
showed a secondary increase at 16 h of continuous hypoxia (Fig. 13A). In 
response to 75 uM CoCI 2 , HIF-1 RNAs peaked at 4 h, declined at 8 h, and 
increased again at 16 h (Fig. 13B). In cells treated with 130 uM desferoxamine, 
a single peak at 1-2 h was seen (Fig. 13C). When cells were incubated at 1% 0 2 

15 for 4 h and then returned to 20% 0 2 , both HIF-1a and HIF-1p RNA decreased to 

below basal levels within 5 min, the earliest time point assayed (Fig. 13D). These 
results demonstrate that, as in the case of HIF-1 DNA-binding activity (Wang & 
Semenza (1993b) supra ). HIF-1 RNA levels are tightly regulated by cellular 0 2 
tension. The marked instability of HIF-1 a RNA in posthypoxic cells may involve 

20 the 3-untranslated region (3-UTR) which contains eight AUUUA sequences (Fig. 

13E) that have been identified in RNAs with short half-lives and shown to have a 
destabilizing effect when introduced into heterologous RNAs (Shaw & Kamen 
(1986) Cell 46:659-667). Seven of the HIF-1a AUUUA sequences conform to a 
more stringent consensus for RNA instability elements, 

25 5'-UUAUUUA(U/A)(U/A)-3 f (SEQ ID NO:26) (Lagnado et al. (1994) Mol. Cell. Biol. 

14:7984-7995). 

Example 7. Antibody Production . 

To analyze HIF-1 protein expression, polyclonal antisera was raised against 
HIF-1 a and HIF-1 (3 as follows. 
30 Rabbits were immunized with recombinant proteins in which 

glutathione-S-transferase (GST) was fused to amino acids 329-531 of HIF-la or 
496-789 of ARNT. To generate antibodies against HIF-1a, a 0.6 kb EcoRI 
fragment from hbc025 was cloned into pGEX-3X (Pharmacia) and transformed 
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into E. co//DH5a cells (GIBCO-BRL). GST/HIF-1a fusion protein was isolated by 
exposure of bacteria (OD^ = 0.8) to 0.1 mM IPTG at room temperature for 1 h; 
sonication in 50 mM Tris-HCI (pH 7.4), 1 mM EDTA, 1 mM EGTA, I mM 
phenylmethylsulfonyl fluoride; centrifugation at 10,000 x g for 10 min; incubation 
of supernatant with glutathione-agarose (Pharmacia) in the presence of 1% NP- 
40 for 1 h at 4°C; and elution with 5 mM reduced glutathione, 50 mM Tris-HC1 
(pH 8.0). 150 mM NaCI. To generate antibodies against HIF-lp, ARNT nt 
1542-2428 were amplified from Hep3B cDNA by PCR with Taq polymerase using 
forward primer 5'-ATAGGATCCTCAGGTCAGCTGGCACCCAG-3" (SEQ ID 
NO:27) and reverse primer 5'-CCAAAGCTTCTATTCTGAAAAGGGGGG-3' (SEQ 
ID NO:28). The product was digested with BamHI and EcoRI, to generate a 
fragment corresponding to ARNT nt 1542-2387, and cloned into pGEX-2T 
(Pharmacia). Fusion protein isolation was as described above, except that 
induction was with 1 mM IPTG for 2 h and binding to glutathione-agarose was in 
the presence of 1% Triton X-100 rather than NP-40. Fusion proteins were 
excised from 10% SDS/polyacrylamide gels and used to immunize New Zealand 
white rabbits (HRP Inc., Denver PA) according to an institutionally-approved 
protocol. Antibodies raised against HIF-lcc were affinity-purified by binding to 
GST/HIF-la coupled to CNBr-activated Sepharose 4B (Pharmacia). 

Results. Antisera was used to demonstrate that the proteins encoded by the 
cloned HIF-1a cDNA and ARNT are components of HIF-I DNA-binding activity 
(Fig. 14A). When crude nuclear extracts from hypoxic cells were incubated with 
probe DNA and either antiserum, the HIF-I/DNA complex seen in the absence of 
antisera was replaced by a more slowly migrating HIF-l/DNA/antibody complex, 
whereas addition of preimmune sera had no effect on the HIF-1/DNA complex. 

Example 8. Immunoblot analysis . 

15 ug aliquots of nuclear protein extracts were resolved on 6% 
SDS/polyacrylamide gels and transferred to nitrocellulose membranes in 20 mM 
Tris-HC1 (pH 8.0), 150 mM glycine, 20% methanol. Membranes were blocked 
with 5% milk/TBS-T [20 mM Tris-HCI (pH 7.6), 137 mM NaCI. 0.1% Tween-20], 
incubated with aff.nity-purif.ed HIF-la antibodies or H I F- 1 p antiserum diluted 1:400 
or 1:5000. respectively, washed, incubated with horseradish peroxidase 
anti-immunoglobulin conjugate diluted 1:5000, washed, and developed with ECL 
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reagents (Amersham) and autoradiography. Incubations were for 1 h in 5% 
milk/TBS-T and washes were for a total of 30 min in TBS-T at room temperature. 

Results . Immunoblot analysis revealed that the antisera detected 
polypeptides in crude nuclear extracts from hypoxic Hep3B or CoCI 2 -treated HeLa 
5 cells which co-migrated with polypeptides present in purified HIF-I protein 

preparations (Fig. 14B). Analysis of nuclear and cytoplasmic extracts prepared 
from Hep3B cells exposed to 1% 0 2 (Fig. 14C) revealed that peak levels of HIF- 
1a and HIF-1 p were present in nuclear extracts at 4-8 h of continuous hypoxia, 
similar to the induction kinetics of HIF-1 DNA-binding activity (Wang & Semenza 

10 (1993) J. Biol. Chem. 268:21513-21518). For HIF-la, the predominant protein 
species accumulating at later time points migrated to a higher position in the gel 
than protein present at earlier time points, suggesting that post-translational 
modification of HIF-1 a may occur. For HIF-1 p, the 94- and 93 kDa species were 
resolved from the 91 kDa form but not from each other and no shifts in migration 

15 were seen. The post-hypoxic decay of HIF-1 proteins was also remarkably rapid 

(Fig. 14D), indicating that, as with the RNAs, these proteins are unstable in post- 
hypoxic cells. For both HIF-1ct and ARNT, 31% of all amino acids are proline, 
glutamic acid, serine, or threonine (PEST) residues, which have been implicated 
in protein instability (Rogers et al. (1986) Science 234:364-368). In HIF-la, two 

20 20 amino acid sequences (499-518 and 581-600; Fig. 10) each contain 15 PEST 

residues. For HIF-1 p (ARNT), redistribution between nuclear and cytoplasmic 
compartments also appeared to play a role in both the induction and decay of 
nuclear protein levels. 

Together with our previous studies of HIF- 1, the results presented here 

25 indicate that HIF- 1 is a heterodimeric bHLH-PAS transcription factor consisting of 

a 120 kDa HIF-la subunit complexed with a 91-94 kDa HIF-1P (ARNT) isoform. 
Thus, ARNT encodes a series of common subunits utilized by both HIF-1 and the 
dioxin receptor, analogous to the heterodimerization of E2A gene products with 
various bHLH proteins (Murre et al. (1989) Cell 58:537-544). Based upon these 

30 results and the similarity of HIF-la and SIM within the bHLH-PAS domain, ARNT 
may also heterodimerize with SIM. In Drosophila, several SIM-regulated genes 
are characterized by enhancer elements that include I-5 copies of the sequence 
5 , -(G/A)(T/A)ACGTG-3 f (SEQ ID NO:29)(Wharton et al. (1994) Development 
120:3563-3569). The observation that the HIF-1, dioxin receptor, and SIM 
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binding sites share the sequence 5'-CGTG-3' supports the hypothesis that ARNT 
is capable of combinatorial association with HIF-1ct, AHR, and SIM since this 
half-site is also recognized by the transcription factors with which ARNT shows 
greatest similarity in the bHLH domain. 

Example 9. Transcriptional Regulat ion ofGenss Encoding Glycoly tic 

Enzvmes by HIF-1 . 

The involvement of HIF-1 in transcriptional regulation of genes encoding 
glycolytic enzymes in hypoxic cells was investigated as follows. 

RNA analysis. Total RNA was isolated from Hep3B and HeLa cells 
(Chomczynski & Sacchi (1987) Anal. Biochem 162:156-159). RNA 
concentrations were determined by absorbance at 260 nm. Agarose gel 
electrophoresis, followed by ethidium bromide staining and visualization of 28 and 
18 S rRNA under UV illumination, confirmed that aliquots from different 
preparations contained equal amounts of intact total RNA. Plasmids N-KS+ and 
H-KS + , provided by P. Maire (Institut Cochin de Genetique Moleculaire, Paris), 
were linearized by digestion with Hindlll. Antisense RNA was synthesized by T3 
RNA polymerase in the presence of 

[a- 32 PIATP. 10 ug of total cellular RNA was hybridized to H or N riboprobe (3 x 
10 5 cpm) for 3 h at 66oC and digested with RNases A and T,; protected fragments 
were analyzed by 8 M urea, 8% polyacrylamide gel electrophoresis (Semenza et 
al. (1990) Mol. Cell. Biol. 10:930-938). Human phosphoglycerate kinase 1 (PGKI) 
cDNA from plasmid pHPGK-7e (Michelson et al. (1985) Proc. Natl. Acad. Sci. 
USA 82:6965-6969), obtained from American Type Culture Collection, and rat 
PKM cDNA from plasmid pM2PK33 (Noguchi et al. (1986) J. Biol. Chem. 
261:13807-13812), provided by T. Noguchi (Osaka University Medical SchooL 
Osaka. Japan), were used as random-labeled probes for blot hybridizations 
performed in QuikHyb (Stratagene) for 1 h at 68 °C, followed by washing in 15 
mM sodium chloride, 1.5 mM sodium citrate, 0.1% SDS at 50 °C. Densitometric 
analysis of autoradiograms was performed with an LKM Ultroscan XL laser 
30 densitometer using computerized peak integration. 

Electrophoretic Mobility Shift Assay fFMSA) Crude nuclear extract 
preparations, conditions of probe preparation, binding reactions, and gel analysis 
were all previously described above. Double-stranded oligonucleotides were 
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synthesized according to the sequences shown in Table 2 except that each 
oligonucleotide contained at its 5-end the sequence 5'-GATC-3\ which formed a 
single-stranded 5' overhang when complementary oligonucleotides were 
annealed. The sense strand sequence of the W18 and M18 oligonucleotides was 
5 as given above. HIF-1 was partially purified from 50 liters of CoCI 2 -treated HeLa 

cells by crude nuclear extract preparation, DEAE-Sepharose chromatography, 
MonoQ fast protein liquid chromatography, and DNA affinity chromatography. 
Incubations with crude nuclear extracts and partially purified HIF-I contained 100 
and 1 ng of denatured calf thymus DNA, respectively. Competition experiments 

10 were performed with 5 ng of unlabeled W18 or MI8 oligonucleotide. 

Tissue culture . Hep3B and HeLa cells were maintained in culture and treated 
with 1% 0 2 , CoCI 2 , DFX, and cycloheximide (CHX) as described above. 

Transient Expression Assay . The psvcat reporter plasm id (pCAT Promoter, 
Promega) contained SV40 early region promoter, bacterial chloramphenicol 

15 acetyltransferase (CAT) coding sequences, SV40 splice, and polyadenylation 
signals. Oligonucleotides were cloned into the Bglll and BamHI sites located 5' 
and 3' to the transcription unit, respectively. Plasmids pNMHcat and pHcat 
(Concordet et al. (1991) Nucleic Acids Res. 19:4173-4180), containing human 
aldolase A gene sequences fused directly to CAT coding sequences, were 

20 provided by P. Maire. pSVpgal (Promega) contained bacterial lacZ coding 

sequences driven by the SV40 early region promoter and enhancer. Plasmids 
were purified by alkaline lysis and two rounds of cesium chloride density gradient 
centrifugation. Hep3B cells were transfected by electroporation with a Gene 
Pulser (Bio-Rad) at 260 V and 960 microfarads. Duplicate electroporations were 

25 pooled and split onto two 10 cm tissue culture dishes (Corning) containing 8 ml of 

media. Cells were allowed to recover for 24 h in a 5% C0 2 95% air incubator at 
37°C, the media was replaced, and one set of duplicate plates was removed to a 
modular incubator chamber, which was flushed with 1% 0 2 , 5% C0 2l balance N 2> 
sealed, and placed at 37°C. Cells were harvested 72 h after transfection, and 

30 extracts were prepared for CAT and p-galactosidase activity. 

Results . The human aldolase A gene (hALDA) contains four noncoding 
exons, N1, N2, M, and H (Maire et al. (1987) J. Mol. Biol. 197:425-438). 
Transcription is initiated at exons N1 and H in most tissues other than muscle. 
Ribonuclease protection assays of RNA isolated from cells exposed to 20 or 1% 
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0 2 for 16 h revealed 3.0- and 2.9-fold higher levels of ALDA RNA initiated from 
exon H in Hep3B and HeLa cells exposed to 1% 0 2> whereas RNA initiated from 
exon N1 increased only 1.7- and 1.1 -fold in hypoxic Hep3B and HeLa cells, 
respectively, suggesting a promoter-specific response to hypoxia. 
5 We next compared the expression of ALDA and phosphoglycerate kinase 1 

(PGKI) RNAin Hep3B cells exposed to 1% 0 2 for 0-16 h. Maximal induction of 
both ALDA and PGK1 RNA showed delayed kinetics, suggesting a requirement 
for protein synthesis during induction, which was confirmed by the demonstration 
that treatment of Hep3B cells with 100 uM CHX decreased induction of ALDA and 
10 PGK1 RNA in hypoxic cells from 6.1- and 8.2-fold to 1.6- and 1.4-fold, 

respectively. 

Treatment of Hep3B cells for 16 h with 75 uM CoCI 2 or 130 uM DFX induced 
both ALDA and PGK1 RNA with ALDA transcripts preferentially initiated from 
exon H. Analysis of the same RNA samples with a probe for PKM revealed that 
PKM RNA was also induced by exposure of Hep3B cells to 1% 0 2 , CoCI 2 , or DFX. 
ALDA, PGK1, and PKM RNAs were also induced by treatment of HeLa cells with 
1% 0 2 , CoCI 2 , or DFX. PFKL RNA was not expressed at detectable levels in 
Hep3B or HeLa cells. These RNA analyses demonstrate that agents that induce 
EPO RNA and HIF-1 activity also induce ALDA, PGK1, and PKM RNA in both 
20 EPO-producing Hep3B and nonproducing HeLa cells, with a requirement for de 

novo protein synthesis, as previously demonstrated for induction of EPO RNA and 
HIF-1 activity (Semenza & Wang (1992) Mol. Cell. Biol. 12:5447-5454). 

Nucleotide sequences of genes encoding glycolytic enzymes present in Gen- 
Bank were searched for potential HIF-1 binding sites using the query sequence 
25 5'-ACGTGC-3\ which contains the 4 guanine residues that contact HIF-1 in the 

DNA major groove (Wang & Semenza (1993b) supra). Double-stranded 
oligonucleotides were synthesized corresponding to S'-flanking sequences (5'-FS) 
of the human PGK1 (hPGKI), human enolase 1 (hENOi), and mouse LDHA 
(mLDHA) genes; 5-untranslated sequences (5'-UT) of hPGKI; and intervening 
30 sequences (IVS) of the hALDA and mPFKL genes. These oligonucleotides 

contained, as potential HIF-1 sites, 5'-TACGTGCT-3" (SEQ ID NO:30). 
5-GACGTGCG-3 1 (SEQ ID NO:31) (which was also found in hEPO 5'-FS). and 
5-CACGTGCG-3' (SEQ ID NO:32). The first sequence is identical to the 
previously identified HIF-1 binding site in the EPO enhancer (Semenza & Wang 
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(1992) supra) , whereas the latter two sequences differ at the first and last 
nucleotides. The ability of these oligonucleotides to bind HIF-1 was tested by 
EMSA. 

When incubated with nuclear extract prepared from Hep3B cells exposed to 
5 1% 0 2 for 4 h ( each probe generated a DNA protein complex of similar mobility 

and intensity to the HIF-1 complex formed with probe W18, corresponding to 
nucleotides 1-18 of the hEPO 3 -FS. In contrast, none of these probes detected 
an HIF-1 complex in nuclear extracts from cells maintained at 20% 0 2 , although 
the EMSA patterns were otherwise similar to those obtained with nuclear extracts 

10 from hypoxic cells. The DNA-protein complex migrating below the HIF-1 complex 
was less intense when hypoxic (compared with non-hypoxic) nuclear extracts 
were assayed. We have previously shown that this complex contains a 
constitutively expressed factor that recognizes the same DNA sequence as HIF-1 
(Wang & Semenza (1993b) supra) . The decreased binding of the constitutive 

15 factor may thus result from competition for binding with HIF-1 in hypoxic extracts. 

EMSA was also performed with a preparation of HIF-1 from CoCI 2 -treated 
HeLa cells that was purified approximately 600-fold by DEAE-cellulose, MonoQ, 
and DNA affinity chromatography. Each probe bound HIF-1 in a manner that was 
qualitatively and quantitatively similar to the complex formed with W18. The 

20 binding of HIF-1 to these probes was sequence-specific as it could be competed 

by an excess of unlabeled W18 but not by mutant oligonucleotide M18, containing 
a 3-nucleotide substitution previously shown to eliminate HIF-1 binding and 
hypoxia-inducible enhancer function. Similar results were obtained when 
competition experiments involving W18 and M18 were performed with crude 

25 nuclear extract from hypoxic Hep3B cells. These results identify novel HIF-1 

binding sites in genes encoding ALDA, ENOI, PFKL, and PGKI as well as in the 
hEPO 5-FS. The 8 oligonucleotides that have been shown to specifically bind 
HIF-1 (Table 2) contain 3 different binding site sequences that are represented by 
the consensus 5 , -(C/G/T)ACGTGC(G/T)-3 , (SEQ ID NO:33). Given the biased 

30 method of ascertainment, it is possible that HIF-1 may recognize other sequences 

not represented by this consensus. In addition to the 6 HIF-1 sites from glycolytic 
genes, the sequence 5-CACGTGCT-3 1 (SEQ ID NO:34) was also present in the 
hENOI 5'-FS at -786 to -793 (Gialongo et al. (1990) Eur. J. Biochem. 190:567- 
573) but was not tested for HIF-1 binding. Thus, a total of 7 probable HIF-1 sites 
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were identified in 20.7 kb of nucleotide sequence reported to GenBank for these 5 
glycolytic genes. In contrast, no sequences matching the consensus HIF- 1 site 
were identified on either DNA strand within a total of 43.5 kb, comprising the 
nucleotide sequences of 5 randomly chosen genes, AFP, BUP4, CREB, DHFR, 
and EPOR (Gibbs et al. (1987) Biochemistry 26; 1332-1 343; Kurihara et al. (1993) 
Biochem. Biophys. Res. Commun. 192:1049-1056; Meyer et ai. (1993) 
Endocrinology 132:770-780; Mitchell et all. (1986) Mol. Cell. Biol. 6:425-440; 
Noguchi et al. (1991) Blood 78:2548-2556). 

To determine whether these HIF-1 binding sites were of functional impor- 
tance, transient expression essays were performed using the reporter genes 
described above. Reporter plasmids were cotransfected into Hep3B cells with 
pSVpgal, which was included as a control for variation in transfection efficiency. 
Transfected cells were split among duplicate plates that were cultured in 1 or 20% 
0 2 for 48 h, CAT and p-galactosidase protein synthesized following transcription 
of reporter and control plasmids, respectively, were quantitated from cellular 
extracts. The basal reporter psvcat, in which transcription of CAT coding se- 
quences was driven by the SV40 early region promoter, generated similar 
CAT/p-galactosidase values in cells cultured at 1 and 20% 0 2 . When one 
(psvcatEPOl) or two (psvcatEP02) copies of the 33-base pair hEPO 3'-FS 
enhancer were cloned 3' to the transcription unit, CAT/p-galactosidase expression 
was induced 4.9- and 17-fold, respectively, in cells cultured at 1% 0 2) consistent 
with previously reported results (Semenza & Wang (1992) su^ra). 

HIF-1 binding site sequences from glycolytic genes were analyzed in the 
same assay. The mPFKL IVS-1 and hPFK1 S^-FS oligonucleotides were chosen, 
as they represented sequences identical to or divergent from the HIF-1 site in the 
hEPO 3-FS and were located 3* or 5* to the transcription initiation site, 
respectively. Two copies of the 24-base pair hPGK1 5'-FS oligonucleotide were 
cloned 5' to the psvcat transcription unit (Fig. 15A), analogous to its location in 
hPGK1 . Expression of pPGK2svcat was induced 5.6-fold in hypoxic cells (Fig. 
15B). Three copies of the 26-base pair mPFK1 IVS-1 oligonucleotide were also 
cloned 5' to the psvcat transcription unit, and pPFKL3svcat mediated a 47-fold 
induction in hypoxic cells (Fig. 15B). 

We also performed experiments with hALDA gene sequences to analyze 
native promoter function and to correlate sequence requirements for induction in 
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the transfection assay with endogenous RNA expression data. The plasmid 
pNMHcat (Concordet et al. (1991) syEra), in which 3.5 kb from the 5*-end of 
hALDA (Maire et al. (1987) sugra) was fused to CAT coding sequences (Fig. 
15A), mediated a 5.5-fold induction in hypoxic cells (Fig. 15B). The plasmid 
5 pHcat contained 0.76 kb of hALDA sequences that are colinear with the 3'-end of 

pNMHcat, starting within IVS-4 and extending 5* to exon H (Fig. 15A). Deletion of 
exons N1, N2, and M and their flanking sequences resulted in 20-fold increased 
levels of CAT expression but had no significant effect on relative expression in 1% 
0 2l as pHcat was induced 5.4-fold in hypoxic Hep3B cells (Fig. 15B). These 

10 results are consistent with the observation of (i) specific induction of hALDA 

transcripts initiated from exon H and (ii) the presence of a HIF-1 binding site at 
the 5' end of IVS-4 contained within both pNMHcat and pHcat. Thus, sequences 
containing HIF-1 sites from the mPFKL, hPGK1, and hALDA genes mediated 
hypoxia-inducible transcription in conjunction with either a native or heterologous 

15 promoter. 

Example 10. Construction of a Dominant-Negative Variant of HIF-1 a . 

A HIF-1 a variant was constructed to investigate functional inactivation of HIF- 

1. 

The starting construct was the HIF-1 a cDNA 3.2-3 cloned into the plasmid 
20 pBluescript SK-. This plasmid was digested with the restriction endonucleases 

Ncol and Bglll to delete sequences encoding amino acids 2-28. A double- 
stranded oligonucleotide was inserted that contained Ncol and Bglll ends to allow 
recirculation of the plasmid in the presence of T4 DNA ligase. The resulting 
construct encodes amino acids 1-3, followed by three amino acids not present in 
25 the corresponding position in wild-type HIF-1 a (isoleucine, alanine, and glycine), 

followed by amino acids 28-826 of HIF-1 a. This construction (pBluescript/HIF- 
1a3.2T7ANB) allows the in vitro transcription (using T7 RNA polymerase) and 
translation of the variant form of HIF-1 a (HIF-1 aANB) (SEQ ID NO:35). 

To create a dominant negative form of HIF-1 a for expression in mammalian 
30 tissue culture cells, a Kpn l-Not I fragment encoding the variant cDNA was 

excised from the pBluescript vector and cloned into the mammalian expression 
vector pCEP4. The plasmid was digested with Aflll and BamHI, treated with 
Klenow form of DNA polymerase to generate blunt ends, and recircularized with 
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T4 DNA ligase. The resulting plasmid (pCEP4/HIF-1 aANBAAB) (SEQ ID NO:3) 
encodes amino acids 1-3, followed by three amino acids not present at the 
corresponding position in wild-type HIF-1a (isoleucine. alanine, and glycine), 
followed by amino acids 28-391 of HIF-1a, followed by three amino acids not 
5 present at the corresponding position in wild-type HIF-1cc (isoleucine, glutamine, 
and threonine). Amino acids 392-826 were deleted to increase the stability of the 
variant protein (HIF-1 aANBAAB) expressed in cells (Fig. 16). 

Results. Hep3B cells were transiently transfected with 25 ug of the reporter 
gene psvcatEP02 which contains two copies of the 33-bp enhancer sequence 
10 from the human erythropoietin gene as described above. This plasmid expressed 
a 9-fold higher level of CAT protein when cells were cultured at 1% O z relative to 
20% O z . When the cells were transfected with psvcatEP02 and pCEP4/HIF- 
1 aANBAAB, there was dose-dependent inhibition of CAT expression at 1% 0 2 . 
Table 3 shows the relative induction (expression at 1% O z divided by expression 
15 at 20% 0 2 ) as a function of the amount of pCEP4/HIF-1 aANBAAB (ug) 

transfected into the cells. Results are the mean of three experiments. 

Expression of variant HIF-1 a interfered with the activation of reporter gene 
expression by endogenous HIF-1 produced by hypoxic cells. The residual 
activation seen with 40 ug variant transfection may represent cells which took up 
20 psvcatEP02 but not pCEP4/HIF-1 aANBAAB. The results show that the 

dominant-negative variant can interfere with HIF-1 function in vivo. 

The variant protein was used in a electrophoretic mobility shift assay of 
binding to a double-stranded oligonucleotide probe containing the HIF-1 binding 
site from the EPO enhancer. pBluescript/HIF-1a3.2T7ANB was used as a 
25 template for in vitro transcription and translation. As increasing amounts of 

pBluescript/HIF-1a3.2T7ANB were added to reactions containing a constant 
amount of templates for wild-type HIF-1 a and HIF-1 B, there was a dose- 
dependent inhibition of DNA-binding such that when pBluescript/HIF-1a3.2T7ANB 
was present in a 16-fold excess over the wild-type template pBluescript/HIF- 
30 1a3.2T7, HIF-1 DNA-binding was eliminated. 

These in vitro and in vivo experiments demonstrate that deletion of the basic 
domain of HIF-1a results in a protein that can block HIF-1 activity by inhibiting 
DNA binding. 
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SEQUENCE 


LOCATION 


COORDINATES 


1 9 CCC TACGTGCT gtctcacacagcctgtctga 


hEPO 3 1 -FS 


ww J/ "J U3 / 


ccgggcagctggcg TACGTGCT gcag 


mPFKL IVS-1 


+336/+36I 


ggggctgctgca GACGTGCG tgtg 
1 gtga GACGTGCG gcttccgtttcy 


hEPO 5'-FS 
hPGKl S'-FS 


"155/-178 ■ 
-172/-194 


ctgcc GACGTGCG ctccggag 


hPGKl S'-QT 


+31/+H 


gtgggagcccagcg GACGTGCG ggaa 


mLDHA 5 1 -FS 


-75/-50 


ggc CADGTGCG ccgcctgcgcctgcq 


hENOl 5*-FS 


-585/- 610 1 


ect CACGTGCG gggaccagggaccgt 


hALDA IVS-4 


+I25/+150 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: The Johns Hopkins University School of Medicine 
(ii) TITLE OF INVENTION: HYPOXIA INDUCIBLE FACTOR- 1 AND METHOD OF USE 
5 (iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 
10 (C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

<vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: PCT/US96/ 

(B) FILING DATE: 06-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Haile, Lisa A. 
25 (B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE /DOCKET NUMBER: 07265/053WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 

30 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GTGAAGACAT CGCGGGGACC GATTCACC ATG GAG GGC GCC GGC GGC GCG AAC 52 

Met Glu Gly Ala Gly Gly Ala Asn 
40 15 

GAC AAG AAA AAG ATA AGT TCT GAA CGT CGA AAA GAA AAG TCT CGA GAT 100 
Asp Lys Lys Lys lie Ser Ser Glu Arg Arg Lys Glu Lys Ser Arg Asp 
10 15 20 

GCA GCC AGA TCT CGG CGA AGT AAA GAA TCT GAA GTT TTT TAT GAG CTT 148 
45 Ala Ala Arg Ser Arg Arg Ser Lys Glu Ser Glu Val Phe Tyr Glu Leu 

25 30 35 40 
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GCT CAT CAG TTG CCA CTT CCA CAT AAT GTG AGT TCG CAT CTT GAT AAG l 96 
Ala H 1S Gin Leu Pro Leu Pro His Asn Val Ser Ser His Leu Asp Lys 
45 50 55 

GCC TCT GTG ATG AGG CTT ACC ATC AGC TAT TTG CGT GTG AGG AAA CTT 244 
Ala Ser Val Met Arg Leu Thr lie Ser Tyr Leu Arg Val Arg Lys Leu 
60 65 70 

CTG GAT OCT GOT GAT TTG GAT ATT GAA GAT GAC ATG AAA GCA CAG ATG 292 
Leu Asp Ala Gly Asp Leu Asp He Glu Asp Asp Met Lys Ala Gin Met 
75 8° 85 

AAT TGC TTT TAT TTG AAA GCC TTG GAT GGT TTT GTT ATG GTT CTC ACA 
Asn Cys Phe Tyr Leu Lys Ala Leu Asp Gly Phe Val Met Val Leu Thr 
90 95 100 



GAT GAT GGT GAC ATG ATT TAC ATT TCT GAT AAT GTG AAC AAA TAC ATG 
Asp Asp Gly Asp Met lie Tyr lie Ser Asp Asn Val Asn Lys Tyr Met 
105 110 US 120 



GGA TTA ACT CAG TTT GAA CTA ACT GGA CAC AGT GTG TTT GAT TTT ACT 
Gly Leu Thr Gin Phe Glu Leu Thr Gly His Ser Val Phe Asp Phe Thr 
125 130 135 

CAT CCA TGT GAC CAT GAG GAA ATG AGA GAA ATG CTT ACA CAC AGA AAT 
His Pro Cys Asp His Glu Glu Met Arg Glu Met Leu Thr His Arg Asn 



140 145 



150 



GGC CTT GTG AAA AAG GGT AAA GAA CAA AAC ACA CAG CGA AGC TTT TTT 
Gly Leu Val Lys Lys Gly Lys Glu Gin Asn Thr Gin Arg Ser Phe Phe 
155 160 16B 



AAG TCT GCA ACA TGG AAG GTA TTG CAC TGC ACA GGC CAC ATT CAC GTA 
Lys Ser Ala Thr Trp Lys Val Leu His Cys Thr Gly His He His Val 
185 190 195 200 



TAT GAT ACC AAC AGT AAC CAA CCT CAG TGT GGG TAT AAG AAA CCA CCT 
Tyr Asp Thr Asn Ser Asn Gin Pro Gin Cys Gly Tyr Lys Lys Pro Pro 
205 210 215 

ATG ACC TGC TTG GTG CTG ATT TGT GAA CCC ATT CCT CAC CCA TCA AAT 
Met Thr Cys Leu Val Leu He Cys Glu Pro He Pro His Pro Ser Asn 
220 225 2 3 0 

ATT GAA ATT CCT TTA GAT AGC AAG ACT TTC CTC AGT CGA CAC AGC CTG 
He Glu He Pro Leu Asp Ser Lys Thr Phe Leu Ser Arg His Ser Leu 
235 240 245 

GAT ATG AAA TTT TCT TAT TGT GAT GAA AGA ATT ACC GAA TTG ATG GGA 
Asp Met Lys Phe Ser Tyr Cys Asp Glu Arg He Thr Glu Leu Met Glv 
250 255 

TAT GAG CCA GAA GAA CTT TTA GGC CGC TCA ATT TAT GAA TAT TAT CAT 
Tyr Glu Pro Glu Glu Leu Leu Gly Arg Ser lie Tyr Glu Tyr Tyr His 
265 270 275 280 



340 



388 



436 



484 



532 



CTC AGA ATG AAG TGT ACC CTA ACT AGC CGA GGA AGA ACT ATG AAC ATA 580 
Leu Arg Met Lys Cys Thr Leu Thr Ser Arg Gly Arg Thr Met Asn l£ 
170 175 180 



628 



676 



724 



772 



820 



868 
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GCT TTG GAC TCT GAT CAT CTG ACC AAA ACT CAT CAT GAT ATG TTT ACT 916 
Ala Leu Asp Ser Asp His Leu Thr Lys Thr His His Asp Met Phe Thr 
285 290 295 

AAA GGA CAA GTC ACC ACA GGA CAG TAC AGG ATG CTT GCC AAA AGA GGT 964 
Lys Gly Gin Val Thr Thr Gly Gin Tyr Arg Met Leu Ala Lys Arg Gly 
300 305 310 

GGA TAT GTC TGG GTT GAA ACT CAA GCA ACT GTC ATA TAT AAC ACC AAG 1012 
Gly Tyr Val Trp Val Glu Thr Gin Ala Thr Val He Tyr Asn Thr Lys 
315 320 325 



10 AAT TCT CAA CCA CAG TGC ATT GTA TGT GTG AAT TAC GTT GTG AGT GGT 

Asn Ser Gin Pro Gin Cys He Val Cys Val Asn Tyr Val Val Ser Gly 
330 335 340 



40 TTT ACC ATG CCC CAG ATT CAG GAT CAG ACA CCT AGT CCT TCC GAT GGA 

Phe Thr Met Pro Gin He Gin Asp Gin Thr Pro Ser Pro Ser Asp Gly 
490 495 500 



1060 



ATT ATT CAG CAC GAC TTG ATT TTC TCC CTT CAA CAA ACA GAA TGT GTC 1108 
He He Gin His Asp Leu He Phe Ser Leu Gin Gin Thr Glu Cys Val 
15 345 350 355 3 60 

CTT AAA CCG GTT GAA TCT. TCA GAT ATG AAA ATG ACT CAG CTA TTC ACC 1156 
Leu Lys Pro Val Glu Ser Ser Asp Met Lys Met Thr Gin Leu Phe Thr 
365 370 375 



AAA GTT GAA TCA GAA GAT ACA AGT AGC CTC TTT GAC AAA CTT AAG AAG 1204 
Lys Val Glu Ser Glu Asp Thr Ser Ser Leu Phe Asp Lys Leu Lys Lys 
380 385 390 

GAA CCT GAT GCT TTA ACT TTG CTG GCC CCA GCC GCT GGA GAC ACA ATC 1252 
Glu Pro Asp Ala Leu Thr Leu Leu Ala Pro Ala Ala Gly Asp Thr He 
395 400 405 

ATA TCT TTA GAT TTT GGC AGC AAC GAC ACA GAA ACT GAT GAC CAG CAA 13 00 

He Ser Leu Asp Phe Gly Ser Asn Asp Thr Glu Thr Asp Asp Gin Gin 
410 415 420 

CTT GAG GAA GTA CCA TTA TAT AAT GAT GTA ATG CTC CCC TCA CCC AAC 134 6 

Leu Glu Glu Val Pro Leu Tyr Asn Asp Val Met Leu Pro Ser Pro Asn 
425 430 435 440 

GAA AAA TTA CAG AAT ATA AAT TTG GCA ATG TCT CCA TTA CCC ACC GCT 13 96 

Glu Lys Leu Gin Asn He Asn Leu Ala Met Ser Pro Leu Pro Thr Ala 
445 450 455 

GAA ACG CCA AAG CCA CTT CGA AGT AGT GCT GAC CCT GCA CTC AAT CAA 144 4 

35 Glu Thr Pro Lys Pro Leu Arg Ser Ser Ala Asp Pro Ala Leu Asn Gin 

460 465 470 

GAA GTT GCA TTA AAA TTA GAA CCA AAT CCA GAG TCA CTG GAA CTT TCT 1492 
Glu Val Ala Leu Lys Leu Glu Pro Asn Pro Glu Ser Leu Glu Leu Ser 
475 480 485 



1540 



AGC ACT AGA CAA AGT TCA CCT GAG CCT AAT AGT CCC AGT GAA TAT TGT 1588 
Ser Thr Arg Gin Ser Ser Pro Glu Pro Asn Ser Pro Ser Glu Tyr Cys 
45 505 510 515 520 
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TTT TAT GTG GAT AGT GAT ATG GTC AAT GAA TTC AAG TTG GAA TTG GTA 

Phe Tyr Val Asp Ser Asp Met Val Asn Glu Phe Lys Leu Glu Leu Val 
525 530 535 

GAA AAA CTT TTT GCT GAA GAC ACA GAA GCA AAG AAC CCA TTT TCT ACT 
Glu Lys Leu Phe Ala Glu Asp Thr Glu Ala Lys Asn Pro Phe Ser Thr 
540 545 550 

CAG GAC ACA GAT TTA GAC TTG GAG ATG TTA GCT CCC TAT ATC CCA ATG 
Gin Asp Thr Asp Leu Asp Leu Glu Met Leu Ala Pro Tyr He Pro Met 
555 560 565 

GAT GAT GAC TTC CAG TTA CGT TCC TTC GAT CAG TTG TCA CCA TTA GAA 
Asp Asp Asp Phe Gin Leu Arg Ser Phe Asp Gin Leu Ser Pro Leu Glu 
570 575 580 

AGC AGT TCC GCA AGC CCT GAA AGC GCA AGT CCT CAA AGC ACA GTT ACA 
Ser Ser Ser Ala Ser Pro Glu Ser Ala Ser Pro Gin Ser Thr Val Thr 
585 5 *° 595 600 

GTA TTC CAG CAG ACT CAA ATA CAA GAA CCT ACT GCT AAT GCC ACC ACT 
Val Phe Gin Gin Thr Gin He Gin Glu Pro Thr Ala Asn Ala Thr Thr 
605 610 615 

ACC ACT GCC ACC ACT GAT GAA TTA AAA ACA GTG ACA AAA GAC CGT ATG 
Thr Thr Ala Thr Thr Asp Glu Leu Lys Thr Val Thr Lys Asp Arg Met 
620 625 630 

GAA GAC ATT AAA ATA TTG ATT GCA TCT CCA TCT CCT ACC CAC ATA CAT 
Glu Asp He Lys He Leu He Ala Ser Pro Ser Pro Thr His He His 
635 640 645 

AAA GAA ACT ACT AGT GCC ACA TCA TCA CCA TAT AGA GAT ACT CAA AGT 
Lys Glu Thr Thr Ser Ala Thr Ser Ser Pro Tyr Arg Asp Thr Gin Ser 
*50 655 660 

CGG ACA GCC TCA CCA AAC AGA GCA GGA AAA GGA GTC ATA GAA CAG ACA 
Arg Thr Ala Ser Pro Asn Arg Ala Gly Lys Gly Val He Glu Gin Thr 
665 670 675 680 

GAA AAA TCT CAT CCA AGA AGC CCT AAC GTG TTA TCT GTC GCT TTG AGT 
Glu Lys Ser His Pro Arg Ser Pro Asn Val Leu Ser Val Ala Leu Ser 
685 690 695 

CAA AGA ACT ACA GTT CCT GAG GAA GAA CTA AAT CCA AAG ATA CTA GCT 
Gin Arg Thr Thr Val Pro Glu Glu Glu Leu Asn Pro Lys He Leu Ala 
700 70S 710 

TTG CAG AAT GCT CAG AGA AAG CGA AAA ATG GAA CAT GAT GGT TCA CTT 
Leu Gin Asn Ala Gin Arg Lys Arg Lys Met Glu His Asp Gly Ser Leu 
715 720 725 

TTT CAA GCA GTA GGA ATT GGA ACA TTA TTA CAG CAG CCA GAC GAT CAT 
Phe Gin Ala Val Gly He Gly Thr Leu Leu Gin Gin Pro Asp Asp His 
730 735 740 

GCA GCT ACT ACA TCA CTT TCT TGG AAA CGT GTA AAA GGA TGC AAA TCT 
Ala Ala Thr Thr Ser Leu Ser Trp Lys Arg Val Lys Gly Cys Lys Ser 
745 75 ° 755 7 60 



1636 



1684 



1732 



1780 



1828 



1876 



1924 



1972 



2020 



2068 



2116 



2164 



2212 



2260 



2308 
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AGT GAA CAG AAT GGA ATG GAG CAA AAG ACA ATT ATT TTA ATA CCC TCT 2356 
Ser Glu Gin Asn Gly Met Glu Gin Lys Thr He He Leu He Pro Ser 
765 770 775 

GAT TTA GCA TGT AGA CTG CTG GGG CAA TCA ATG GAT GAA AGT GGA TTA 2404 
Asp Leu Ala Cys Arg Leu Leu Gly Gin Ser Met Asp Glu Ser Gly Leu 
760 785 790 

CCA CAG CTG ACC AGT TAT GAT TGT GAA GTT AAT GCT CCT ATA CAA GGC 2452 
Pro Gin Leu Thr Ser Tyr Asp Cys Glu Val Asn Ala Pro He Gin Gly 
795 800 805 



10 AGC AGA AAC CTA CTG CAG GGT GAA GAA TTA CTC AGA GCT TTG GAT CAA 

Ser Arg Asn Leu Leu Gin Gly Glu Glu Leu Leu Arg Ala Leu Asp Gin 
810 815 820 



2500 



GTT AAC T GAGCTTTTTC TTAATTTCAT TCCTTTTTTT GGACACTGGT GGCTCACTAC 2557 
Val Asn 
15 825 

CTAAAGCAGT CTATTTATAT TTTCTACATC TAATTTTAGA AG CCTGG CTA CAATACTGCA 2617 

CAAACTTGGT TAGTT CAATT TTTGATCCCC TTTCTACTTA ATTTACATTA ATGCTCTTTT 2677 

TTAGTATGTT CTTTAATGCT GGATCACAGA CAGCTCATTT TCTCAGTTTT TTGGTATTTA 2737 

AACCATTGCA TTG CAG TAG C ATCATTAATT AAAAAATGCA CCTTTTTATT TATTTATTTT 2797 

20 TGG CTAGGGA GTTTATCCCT TTTTCGAATT ATTTTTAAGA AGATGCCAAT ATAATTTTTG 2857 

TAAGAAGGCA GTAACCTTTC ATCATGATCA TAGGCAGTTG AAAAATTTTT ACACCTTTTT 2917 

TTTCACAAAT TTTACATAAA TAATAATGCT TTGCCAGCAG TACGTGGTAG CCACAATTGC 2977 

ACAATATATT TTCTTAAAAA ATACCAGCAG TTACTCATGG AATATATTCT GCGTTTATAA 3037 

AAC TAGTTTT TAAGAAGAAA TTTTTTTTGG C CTATGAAAT TGTTAAACAA CTGGAACATG 3 097 

25 ACATTGTTAA TCATATAATA ATGATTCTTA AATG CTGTAT GGTTTATTAT TTAAATGGGT 3157 

AAAGCCATTT ACATAATATA GAAAGATATG CATATATCTA GAAGGTATGT GG CATTTATT 3217 

TGGATAAAAT TCTCAATTCA GAGAAATCAA ATCTGATGTT TCTATAGTCA CTTTGCCAGC 3277 

TCAAAAGAAA ACAATACCCT ATGTAGTTGT GGAAGTTTAT GCTAATATTG TGTAACTGAT 333 7 

ATTAAACCTA AATGTTCTGC CTACCCTGTT GGTATAAAGA TATTTTGAG C AGACTGTAAA 33 97 

30 CAAGAAAAAA AAAAAATCAT GCATTCTTAG CAAAATTGCC TAG TATGTTA ATTTGCTCAA 3457 

AATACAATGT TTGATTTTAT GCACTTTGTC GCTATTAACA TCCTTTTTTT CATGTAGATT 3517 

TCAATAATTG AGTAATTTTA GAAGCATTAT TTTAGGAATA TATAGTTGTC AAAAACAGTA 3577 
AATATCTTGT TTTTTCTATG TACATTGTAC AAATTTTTCA TTCCTTTTGC TCTTTGTGGT • 363 7 

TGGATCTAAC ACTAACTGTA TTGTTTTGTT ACATCAAATA AACATCTTCT GTGGAAAAAA 3697 

35 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 3736 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu Gly Ala Gly Gly Ala Asn Asp Lys Lys Lys He Ser Ser Glu 

Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser Arg Arg Ser Lys 
20 25 30 

Glu Ser Glu Val Phe Tyr Glu Leu Ala His Gin Leu Pro Leu Pro His 
35 40 45 

Asn Val Ser Ser His Leu Asp Lys Ala Ser Val Met Arg Leu Thr He 
50 55 go 

Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp He 
" 70 75 so 

Glu Asp Asp Met Lys Ala Gin Met Asn Cys Phe Tyr Leu Lys Ala Leu 
85 go 95 

Asp Gly Phe Val Met Val Leu Thr Asp Asp Gly Asp Met He Tyr He 
100 105 110 

Ser Asp Asn Val Asn Lys Tyr Met Gly Leu Thr Gin Phe Glu Leu Thr 
115 120 12 5 

25 Gly ?™ Ser Val Phe Asp phe Thr His Pro AS P His Glu Glu Met 

^° 130 135 14Q 

Arg Glu Met Leu Thr His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu 

150 i5 5 lg0 

Gin Asn Thr Gin Arg Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr 
165 170 175 

Ser Arg Gly Arg Thr Met Asn He Lys Ser Ala Thr Trp Lys Val Leu 
180 185 190 

His Cys Thr Gly His lie His Val Tyr Asp Thr Asn Ser Asn Gin Pro 
195 200 205 

Gin Cys Gly Tyr Lys Lys Pro Pro Met Thr Cys Leu Val Leu He Cvs 
210 215 220 



20 



30 



35 



40 



Glu Pro He Pro His Pro Ser Asn He Glu He Pro Leu Asp Ser Lys 
225 230 235 2 4 0 

Thr Phe Leu Ser Arg His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp 



245 250 



255 



Glu Arg He Thr Glu Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly 
260 265 270 
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Arg Ser He Tyr Glu Tyr Tyx His Ala Leu Asp Ser Asp His Leu Thr 
275 280 285 

Lys Thr His His Asp Met Phe Thr Lys Gly Gin Val Thr Thr Gly Gin 
290 295 300 

5 Tyr Arg Met Leu Ala Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gin 

305 310 315 320 

Ala Thr Val He Tyr Asn Thr Lys Asn Ser Gin Pro Gin Cys He Val 
325 330 335 

Cys Val Asn Tyr Val Val Ser Gly He He Gin His Asp Leu He Phe 
10 340 345 350 

Ser Leu Gin Gin Thr Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp 
355 360 365 

Met Lys Met Thr Gin Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser 
370 375 380 

15 Ser Leu Phe Asp Lys Leu Lys Lys Glu Pro Asp Ala Leu Thr Leu Leu 

385 390 395 400 

Ala Pro Ala Ala Gly Asp Thr He He Ser Leu Asp Phe Gly Ser Asn 
405 410 . 415 

Asp Thr Glu Thr Asp Asp Gin Gin Leu Glu Glu Val Pro Leu Tyr Asn 
20 420 425 430 

Asp Val Met Lieu Pro Ser Pro Asn Glu Lys Leu Gin Asn He Asn Leu 
435 440 445 

Ala Met Ser Pro Leu Pro Thr Ala Glu Thr Pro Lys Pro Leu Arg Ser 
450 455 460 

25 Ser Ala Asp Pro Ala Leu Asn Gin Glu Val Ala Leu Lys Leu Glu Pro 

465 470 475 480 

Asn Pro Glu Ser Leu Glu Leu Ser Phe Thr Met Pro Gin He Gin Asp 
485 490 495 

Gin Thr Pro Ser Pro Ser Asp Gly Ser Thr Arg Gin Ser Ser Pro Glu 
30 500 505 510 

Pro Asn Ser Pro Ser Glu Tyr Cys Phe Tyr Val Asp Ser Asp Met Val 
515 520 525 

Asn Glu Phe Lys Leu Glu Leu Val Glu Lys Leu Phe Ala Glu Asp Thr 
530 535 540 

35 Glu Ala Lys Asn Pro Phe Ser Thr Gin Asp Thr Asp Leu Asp Leu Glu 

545 550 555 560 

Met Leu Ala Pro Tyr He Pro Met Asp Asp Asp Phe Gin Leu Arg Ser 
565 570 575 

Phe Asp Gin Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser 
40 580 585 590 
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Ala Ser Pro Gin Ser Thr Val Thr Val Phe Gin Gin Thr Gin lie Gin 
595 600 605 

Glu Pro Thr Ala Asn Ala Thr Thr Thr Thr Ala Thr Thr Asp Glu Leu 
610 615 620 

5 Lys Thr Val Thr Lys Asp Arg Met Glu Asp lie Lys lie Leu He Ala 

625 «0 635 640 

Ser Pro Ser Pro Thr His He His Lys Glu Thr Thr Ser Ala Thr Ser 
645 650 655 

Ser Pro Tyr Arg Asp Thr Gin Ser Arg Thr Ala Ser Pro Asn Arg Ala 
1U 660 665 670 

Gly Lys Gly Val He Glu Gin Thr Glu Lys Ser His Pro Arg Ser Pro 
675 680 685 



15 



20 



25 



30 



40 



Asn Val Leu Ser Val Ala Leu Ser Gin Arg Thr Thr Val Pro Glu Glu 
690 695 700 

Glu Leu Asn Pro Lys He Leu Ala Leu Gin Asn Ala Gin Arg Lys Arq 
705 710 715 ^ 72 o 

Lys Met Glu His Asp Gly Ser Leu Phe Gin Ala Val Gly He Gly Thr 
7 25 730 735 

Leu Leu Gin Gin Pro Asp Asp His Ala Ala Thr Thr Ser Leu Ser Trp 
740 745 7 50 

Lys Arg Val Lys Gly Cys Lys Ser Ser Glu Gin Asn Gly Met Glu Gin 
755 760 765 

Lys Thr He He Leu He Pro Ser Asp Leu Ala Cys Arg Leu Leu Gly 
770 775 780 

Gin Ser Met Asp Glu Ser Gly Leu Pro Gin Leu Thr Ser Tyr Asn Cvs 
785 790 795 P eoo 

Glu Val Asn Ala Pro He Gin Gly Ser Arg Asn Leu Leu Gin Gly Glu 
80S 810 

Glu Leu Leu Arg Ala Leu Asp Gin Val Asn 
820 825 

(2) INFORMATION FOR SEQ ID NO: 3: 



815 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS : not relevant 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

Met Glu Gly He Ala Gly Ser Arg Arg Ser Lys Glu Ser Glu Val Phe 
1 5 io 



15 
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Tyr Glu Leu Ala His Gin Leu Pro Leu Pro His Asn Val Ser Ser His 
20 25 30 

Leu Asp Lys Ala Ser Val Met Arg Leu Thr lie Ser Tyr Leu Arg Val 
35 40 45 

5 Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp lie Glu Asp Asp Met Lys 

50 55 60 

Ala Gin Met Asn Cys Phe Tyr Leu Lys Ala Leu Asp Gly Phe Val Met 
65 70 75 80 

Val Leu Thr Asp Asp Gly Asp Met lie Tyr lie Ser Asp Asn Val Asn 
10 85 90 95 

Lys Tyr Met Gly Leu Thr Gin Phe Glu Leu Thr Gly His Ser Val Phe 
100 105 110 

Asp Phe Thr His Pro Cys Asp His Glu Glu Met Arg Glu Met Leu Thr 
115 120 125 

15 His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu Gin Asn Thr Gin Arg 

130 135 140 

Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr Ser Arg Gly Arg Thr 
145 150 155 160 

Met Asn He Lys Ser Ala Thr Trp Lys Val Leu His Cys Thr Gly His 
20 165 170 175 

He His Val Tyr Asp Thr Asn Ser Asn Gin Pro Gin Cys Gly Tyr Lys 
180 185 190 

Lys Pro Pro Met Thr Cys Leu Val Leu He Cys Glu Pro He Pro His 
195 200 205 

25 Pro Ser Asn He Glu He Pro Leu Asp Ser Lys Thr Phe Leu Ser Arg 

210 215 220 

His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp Glu Arg He Thr Glu 
225 230 235 240 

Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly Arg Ser He Tyr Glu 
30 245 250 255 

Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr Lys Thr His His Asp 
260 265 270 

Met Phe Thr Lys Gly Gin Val Thr Thr Gly Gin Tyr Arg Met Leu Ala 
275 280 285 

35 Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gin Ala Thr Val He Tyr 

290 295 300 

Asn Thr Lys Asn Ser Gin Pro Gin Cys He Val Cys Val Asn Tyr Val 
305 310 315 320 

Val Ser Gly He He Gin His Asp Leu He Phe Ser Leu Gin Gin Thr 
40 325 330 335 
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Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp Met Lys Met Thr Gin 



340 345 



350 



Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser Ser Leu Phe Asp Lys 
355 360 3g5 

Leu Lys He Gin Thr 
370 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Gly He Ala Gly Ser Arg Arg Ser Lys Glu Ser Glu Val 



10 



Phe 



15 



Tyr Glu Leu Ala His Gin Leu Pro Leu Pro His Asn Val Ser Ser His 
20 25 30 

Leu Asp Lys Ala Ser Val Met Arg Leu Thr He Ser Tyr Leu Arg Val 
35 40 45 

Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp He Glu Asp Asp Met Lys 



60 



Ala Gin Met Asn Cys Phe Tyr Leu Lys Ala Leu Asp Gly Phe Val Met 

70 75 eo 

Val Leu Thr Asp Asp Gly Asp Met He Tyr He Ser Asp Asn Val Asn 
85 90 95 

Lys Tyr Met Gly Leu Thr Gin Phe Glu Leu Thr Gly His Ser Val Phe 
100 105 110 

Asp Phe Thr His Pro Cys Asp His Glu Glu Met Arg Glu Met Leu Thr 
115 120- 125 

His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu Gin Asn Thr Gin Arg 

135 140 



Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr Ser Arg Gly Arg Thr 

150 155 160 

Met Asn He Lys Ser Ala Thr Trp Lys Val Leu His Cys Thr Gly His 
165 170 175 

He His Val Tyr Asp Thr Asn Ser Asn Gin Pro Gin Cys Gly Tyr Lys 
180 185 190 

4Q Lys Pro Pro Met Thr Cys Leu Val Leu He Cys Glu Pro He Pro His 

195 9nn ~ ~ - 



200 20S 
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Pro Ser Asn lie Glu lie Pro Leu Asp Ser Lys Thr Phe Leu Ser Arg 
210 215 220 

His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp Glu Arg He Thr Glu 
225 230 235 240 

5 Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly Arg Ser He Tyr Glu 

245 250 255 

Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr Lys Thr His His Asp 
260 265 270 

Met Phe Thr Lys Gly Gin Val Thr Thr Gly Gin Tyr Arg Met Leu Ala 
10 275 280 285 

Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gin Ala Thr Val He Tyr 
290 295 300 

Asn Thr Lys Asn Ser Gin Pro Gin Cys He Val Cys Val Asn Tyr Val 
305 310 315 320 

15 Val Ser Gly He He Gin His Asp Leu He Phe Ser Leu Gin Gin Thr 

325 330 335 

Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp Met Lys Met Thr Gin 
340 345 350 

Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser Ser Leu Phe Asp Lys 
20 355 360 365 

Leu Lys Lys Glu Pro Asp Ala Leu Thr Leu Leu Ala Pro Ala Ala Gly 
370 375 380 

Asp Thr He He Ser Leu Asp Phe Gly Ser Asn Asp Thr Glu Thr Asp 
385 390 395 400 

25 Asp Gin Gin Leu Glu Glu Val Pro Leu Tyr Asn Asp Val Met Leu Pro 

405 410 415 

Ser Pro Asn Glu Lys Leu Gin Asn He Asn Leu Ala Met Ser Pro Leu 
420 425 430 

Pro Thr Ala Glu Thr Pro Lys Pro Leu Arg Ser Ser Ala Asp Pro Ala 
30 435 440 445 

Leu Asn Gin Glu Val Ala Leu Lys Leu Glu Pro Asn Pro Glu Ser Leu 
450 455 460 

Glu Leu Ser Phe Thr Met Pro Gin He Gin Asp Gin Thr Pro Ser Pro 
465 470 475 480 

35 Ser Asp Gly Ser Thr Arg Gin Ser Ser Pro Glu Pro Asn Ser Pro Ser 

485 490 495 

Glu Tyr Cys Phe Tyr Val Asp Ser Asp Met Val Asn Glu Phe Lys Leu 
500 505 510 

Glu Leu Val Glu Lys Leu Phe Ala Glu Asp Thr Glu Ala Lys Asn Pro 
40 515 520 525 
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Phe Ser Thr Gin Asp Thr Asp Leu Asp Leu Glu Met Leu Ala Pro Tvr 
530 535 540 

lie Pro Met Asp Asp Asp Phe Gin Leu Arg Ser Phe Asp Gin Leu Ser 
545 550 555 560 

Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser Ala Ser Pro Gin Ser 
565 570 5?5 

Thr Val Thr Val Phe Gin Gin Thr Gin He Gin Glu Pro Thr Ala Asn 
580 585 590 

Ala Thr Thr Thr Thr Ala Thr Thr Asp Glu Leu Lys Thr Val Thr Lys 
595 600 6 o5 

Asp Arg Met Glu Asp He Lys He Leu He Ala Ser Pro Ser Pro Thr 
610 615 620 

His He His Lys Glu Thr Thr Ser Ala Thr Ser Ser Pro Tyr Arq Aso 
625 630 635 3 640 

Thr Gin Ser Arg Thr Ala Ser Pro Asn Arg Ala Gly Lys Gly Val He 
645 650 655 

Glu Gin Thr Glu Lys Ser His Pro Arg Ser Pro Asn Val Leu Ser Val 
660 665 670 

on Ala Leu Ser Gln ^9 Thr Thr Val Pro Glu Glu Glu Leu Asn Pro Lys 

ZU 675 680 685 

He Leu Ala Leu Gin Asn Ala Gin Arg Lys Arg Lys Met Glu His Asp 
690 695 700 



15 



25 



Gly Ser Leu Phe Gin Ala Val Gly He Gly Thr Leu Leu Gin Gin Pro 
705 710 715 720 

Asp Asp His Ala Ala Thr Thr Ser Leu Ser Trp Lys Arg Val Lys Gly 
72 5 730 735 

Cys Lys Ser Ser Glu Gin Asn Gly Met Glu Gin Lys Thr He He Leu 
740 745 7 5o 

30 Ile Pro Ser As P Leu Ala Cys Arg Leu Leu Gly Gin Ser Met Asp Glu 

755 760 7 65 

Ser Gly Leu Pro Gin Leu Thr Ser Tyr Asp Cys Glu Val Asn Ala Pro 
770 775 780 

He Gin Gly Ser Arg Asn Leu Leu Gin Gly Glu Glu Leu Leu Arc Al 
785 790 795 



35 



40 



a 

800 



Leu Asp Gin Val Asn 
805 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GATCGCCCTA CGTG CTGTCT CA 
(-2) INFORMATION FOR SEQ ID NO: 6: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GATCGCCCTA AAAG CTGTCT CA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

20 (ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 15 and 27 is inosine . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATCGGATCCA TCACNGARCT SATGGGNTAT A 

(2) INFORMATION FOR SEQ ID NO: 8: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(D) OTHER INFORMATION: N is inosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATTAAG CMTG GTSAGGTGGT CNSWGTC 

35 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ATTAAGCTTG CATGGTAGTA YTCATAGAT 
(2) INFORMATION FOR SEQ ID NO: 10: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 <ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(D) OTHER INFORMATION: N is inosine . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATAAAGCTTG TSTAYGTSTC NGAYTCGG 

15 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



29 



28 



(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(D) OTHER INFORMATION: N is inosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

25 ATCGAATTCY TCNGACTGNG GCTGGTT 2? 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
TACGGATCCG CCATGGCGGC GACTACTGA 29 
35 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGCCAGGGCA CTACAGGTGG GTACC 
(2) INFORMATION FOR SEQ ID NO : 14 : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 <ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GTTCCCCGCA AGGACTTCAT GTGAG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

He Thr Glu Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly Arg 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa He He Leu He Pro Ser Asp Leu Ala Xaa Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ser lie Tyr Glu Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr Lys 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

5 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
<D> TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ser Phe Phe Leu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

^ 5 < i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



30 



35 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCCRCCATGG 

(2) INFORMATION FOR SEQ ID NO:20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTCACCATGG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



10 



10 
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Val Val Tyr Val Ser Asp Ser Val Thr Pro Val Leu Asn Gin Pro Gin 
1 5 10 15 

Ser Glu 



5 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 
(BJ TYPE: amino acid 
(C) STRANDEDNESS : not relevant 
10 {D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Thr Ser Gin Phe Gly Val Gly Ser Phe Gin Thr Pro Ser Ser Phe Ser 
1 5 10 15 

15 Ser Met Xaa Leu Pro Gly Ala Pro Thr Ala Ser Pro Gly Ala Ala Ala 

20 25 30 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 23: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

CACGTG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

BACGTGC 

(2) INFORMATION FOR SEQ ID NO: 25: 



BNSDOCID: <WO 9639426A1_IA> 



WO 96/39426 



PCT/US96/10251 



-66- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

<D) OTHER INFORMATION: N is inosine . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
10 TNGNG CGTGM SA 

12 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
UUAUUUAWW 

-0 (2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATAGGATCCT CAGGTCAGCT GGCACCCAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCAAAGCTTC TATTCTGAAA AGGGGGG 
(2) INFORMATION FOR SEQ ID NO: 29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
RWACGTG 

(2) INFORMATION FOR SEQ ID NO: 30: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TACGTGCT 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GACGTGCG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
35 CACGTGCG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



BNSDOCID: <WO_9639426A1_IA> 



WO 96/39426 



PCT/US96/10251 



-68- 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
BACGTGCK 

(2) INFORMATION FOR SEQ ID NO: 34: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CACGTGCT 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Glu Gly He Ala Gly Ala Asn Asp Lys Lys Lys He Ser Ser Glu 
15 io 15 

Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser Arg Arg 
20 25 30 
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Claims 

1 . Purified human HIF-1 . 

2. The human HIF-1a polypeptide encoded by 

(a) the DNA sequence set out in Fig. 10 (SEQ ID NO:1) or its 
5 complementary strand; and 

(b) DNA sequences which hybridize under stringent conditions to the 
DNA sequences defined in (a). 

3. An isolated nucleotide sequence encoding the human HIF-1a 
polypeptide. 

4. The isolated nucleotide sequence of claim 3 selected from the group 
consisting of: 

(a) SEQ ID NO:1; 

(b) nucleic acid sequences complementary to SEQ ID NO:1; 

(c) fragments of (a) or (b) that are at least 15 bases in length and that will 
selectively hybridize to nucleotides which encode the HIF-1a polypeptide of SEQ 
ID NO:1, under stringent conditions. 

5. The nucleotide of claim 3, wherein the nucleotide is isolated from a 
mammalian cell. 

6. The nucleotide of claim 5, wherein the mammalian cell is a human 
20 cell. 

7. An expression vector including the nucleotide of claim 3. 

8. The vector of claim 7, wherein the vector is a plasmid. 

9. The vector of claim 7, wherein the vector is a virus. 

10. A host cell stably transformed with the vector of claim 7. 



10 



15 
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1 1 . The host cell of claim 10, wherein the cell is prokaryotic. 

12. The host cell of claim 10, wherein the cell is eukaryotic. 

13. A purified antibody that binds to HIF-1 or to the HIF-1 a polypeptide or 
immunoreactive fragments thereof. 

5 14. The antibody of claim 13, wherein the antibody is polyclonal. 

15. The antibody of claim 13, wherein the antibody is monoclonal. 

16. A purified and isolated nucleotide sequence encoding a polypeptide 
having an amino acid sequence sufficiently duplicative of HIF-1 a to allow 
possession of the biological activities of promoting the synthesis of erythropoietin 

10 (EPO), aldolase A (ALDA), phosphoglycerate kinase 1 (PGK1), pyruvate kinase M 

(PKM) and vascular endothelial growth factor (VEGF) in Hep3B cells. 

17. A human HIF-1 a variant polypeptide which dimerizes with an HIF-1 p 
isoform wherein at least one of the amino acids of SEQ ID NO:2 is replaced by 
another amino acid. 

15 18. An isolated nucleotide sequence encoding the human variant HIF-1 a 

polypeptide having the sequence of SEQ ID NO:4. 

19. A method of detecting HIF-1 a comprising contacting a specimen of a 
subject with a reagent that binds HIF-1 a and detecting binding of the reagent to 
HIF-1a. 

20 20 - The method of claim 19 wherein the reagent is a nucleotide sequence 

complementary to SEQ ID NO:1 or a portion thereof. 

21 . The method of claim 18 wherein the reagent is an antibody specific for 
HIF-1 a. 
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22. A method for enhancing expression of a structural genetic sequence 
whose regulatory region contains an HIF-1 binding site, comprising administering 
a therapeutically effective amount of a nucleotide sequence encoding HIF-1 a, 
whereby expression of the structural genetic sequence is enhanced. 

5 23. The method of claim 22, wherein the structural genetic sequence 

encodes EPO. 

24. The method of claim 22, wherein the structural genetic sequence 
encodes VEGF. 

25. The method of claim 22, wherein the structural genetic sequence 
1 0 encodes a glycolytic enzyme. 

26. A method of treating hypoxia-related tissue damage in a subject in 
need thereof, comprising administering a therapeutically effective amount of a 
nucleotide sequence encoding HIF-1 a, wherein tissue damage is substantially 
inhibited. 
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27. A method of treating hypoxia-related tissue damage in a subject 
need thereof, comprising introducing a nucleotide sequence of claim 3 into cells of 
the subject, wherein a therapeutically effective amount of HIF-1a is expressed in 
the subject, wherein tissue damage is substantially inhibited. 

5 28. A method for inhibiting expression of a structural genetic sequence 

whose regulatory region contains an HIF-1 binding site, comprising administering 
a therapeutically effective amount of an inhibitory nucleotide sequence, whereby 
expression of the structural genetic sequence is inhibited. 

29. The method of claim 28 wherein the inhibitory nucleotide sequence 
10 hybridizes to an HIF-1 a encoding nucleotide sequence. 

30. The method of claim 29, wherein the HIF-1 a encoding nucleotide 
sequence is RNA. 

31 . The method of claim 29, wherein the HIF-1 a encoding nucleotide 
sequence is DNA. 

15 32 The method of claim 28 wherein the inhibitory nucleotide sequence 

encodes an HIF-1 a variant polypeptide. 

33. A pharmaceutical composition comprising a pharmaceutically 
acceptable carrier admixed with a therapeutically effective amount of HIF-1 

34. A pharmaceutical composition comprising a nucleotide sequence 
20 encoding HIF-1 a in a pharmaceutically acceptable carrier. 

35. A pharmaceutical composition comprising an HIF-1 a inhibitory 
nucleotide sequence in a pharmaceutically acceptable carrier. 
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FIG. 10-1 

1 
i 

62 AAG A3A ACT TCP GAA OCT OGA AAA GAA AAG 

12 lys ile ser ser glu arg arg lys glu lys 

182 TOG CAT CTT GAT AAG GOC TCP GIG AUG AGG 

52 ser his leu asp lys ala ser val net arg 

302 ™ TIG AAA GOC TIG GAT OCT TTT GIT AUG 

92 tyr leu lys ala leu asp gly phe val met 

422 GIG TTT GAT TTT ACT CAT OCA TGT GAC CAT 

132 val phe asp phe thr his pro cys asp his 

542 AAG TGT AOC CIA ACT AGC OGA GGA AGA ACT 

172 lys cys thr leu thr ser arg gly arg thr 

662 TAT AAG AAA OCA OCT AUG AOC TGC TIG GIG 

212 tyr lys lys pro pro met thr cys leu val 

782 TTT TCT TAT TGT GAT GAA AGA ATT AOC GAA 

252 phe ser tyr cys asp glu arg ile thr alxi 

902 CAT GAT ATG TTT ACT AAA GGA CAA GIC AOC 

292 his asp met phe thr lys gly gin val thr 

1022 OCA CAG TGC ATT GUV TGT GIG AAT TAC GIT 

332 pro gin cys ile val cys val asn tyr val 

1142 ACT CAG CTA TTC AOC AAA GIT GAA TCA GAA 

372 thr gin leu phe thr lys val glu ser glu 

1262 GAT TTT GQC AGC AAC GAC ACA GAA ACT GAT 

412 asp phe gly ser asn asp thr glu thr asp 

1382 OCA TIA CCC ACC GOT GAA AOG OCA AAG OCA 

452 pro leu pro thr ala glu thr pro lys pro 
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TCT OGA GAT GCA GQC AGA TCP COG OGA ACT 
ser arg asp ala a la arg ser arg arg ser 
CTT ADC ATC AGC TAT TIG OCT GIG AGS AAA 
leu thr ile ser tyr leu arg val arg lys 
GIT CIC ACA GAT GAT OCT GAC AUG ATT TAC 
val leu thr asp asp gly asp net ile tyr 
GAG GAA AIG AGA GAA AUG CTT ACA CAC AGA 
glu glu met arg glu met leu thr his arg 
AIG AAC ATA AAG TCT GCA ACA TOG AAG GIA 
met asn ile lys ser ala thr trp lys val 
CIG ATT TCT GAA COC ATT CCT CAC CCA TGA 
leu ile cys glu pro ile pro his pro ser 
TIG AIG QGA TAT GAG OCA GAA GAA CTT TTA 
leu met alv tvr alu pro crlu alu leu Iphi 
ACA QGA CAG TAC AQG AUG CTT GCC AAA AGA 
thr gly gin tyr arg met leu ala lys arg 
GIG ACT OCT ATT ATT CAG CAC GAC TIG ATT 
val ser gly ile ile gin his asp leu ile 
GAT ACA ACT AGC CIC TTT GAC AAA CTT AAG 
asp thr ser ser leu phe asp lys leu lys 
GAC CAG CAA CTT GAG GAA GIA OCA TTA TAT 
asp gin gin leu glu glu val pro leu tyr 
CTT OGA ACT ACT GOT GAC OCT GCA CIC AAT 
leu arg ser ser ala asp pro ala leu asn 
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FIG. 10-3 

CTGA&SACMOGOaC^^ AIG 

met 

AAA. GAA TCT GAA GIT TIT TAT GAG CTT GCT 
lys glu ser glu val phe tyr glu leu ala 
CTT CIG GAT GCT GCT GAT TIG GAT ATT GAA 
leu leu asp ala gly asp leu asp ile glu 
ATT TCT GAT AAT GIG AAC AAA TAG AIG GGA 
ile ser asp asn val asn lys tyr met gly 
AAT QQC CTT GIG AAA AAG OCT AAA GAA CAA 
asn gly leu val lys lys gly lys glu gin 
TTG CAC TGC ACA GGC CAC ATT GAG GIA TAT 
leu his cys thr gly his ile his val tyr 
AAT ATT GAA ATT OCT TTA GAT AGC AAG ACT 
asn ile glu ile pro leu asp ser lys thr 
GGC GGC TCA ATT TAT GAA TAT TAT CAT GCT 
qlv ara ser ile tvr alu t-vr tvr his al* 
GGT GGA TAT GIG TOG GIT GAA ACT CAA QCA 
gly gly tyr val trp val glu thr gin ala 
TTG TGC CTT CAA CAA ACA GAA TCT GIG CTT 
phe ser leu gin gin thr glu cys val leu 
AAG GAA OCT GAT OCT TTA ACT TTG CIG GGC 
lys glu pro asp ala leu thr leu leu ala 
AAT GAT GTA. AIG CIG GGC TCA OCC AAC GAA 
asn asp val met leu pro ser pro asn glu 
CAA GAA GIT GCA TTA AAA TTA. GAA OCA AAT 
gin glu val ala leu lys leu glu pro asn 
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FIG. 10-4 

GAG QGC GOC QGC QGC GOS AAC GAC AAG AAA 
glu gly ala gly gly ala asn asp lys lys 
CAT CAG TIG CCA CTT CCA CAT AAT GIG ACT 
his glxi leu pro leu pro his asn val ser 
GAT GAC AUG AAA GCA CAG ATC AAT TGC ITT 
asp asp met lys ala gin met asn cys pfoe 
T3A ACT CAG TIT GAA CIA ACT GGA CAC ACT 
leu t±ir gin phe glu leu thr gly his ser 
AAC ACA CAG GGA AGC TIT TIT CIC AGA ATC 
asn thr gin arg ser nhp p he len am net 
GAT ACC AAC ACT AAC CAA OCT CAG TCT G3G 
asp t±ir asn ser asn gin pro gin cys gly 
TIC CIC ACT GGA CAC AGC CIG GAT ATS AAA 
phe leu ser arg his ser leu asp net lys 
TIG GAC TCT GAT CAT CIG ACC AAA ACT CAT 
leu asp ser asn his le*i rtrr lyp thr his 
ACT GIC A3A TAT AAC ACC AAG AAT TCT CAA 
thr val ile tyr asn thr lys asn ser gin 
AAA CCG GIT GAA TCT TCA GAT AUG AAA AUG 
lys pro val glu ser ser asp met lys met 
CCA GOC OCT GGA GAC ACA ATC ATA TCT TEA 
pro ala ala gly asp thr ile ile ser leu 
AAA TEA CAG AAT ATA AAT TIG OCA AUG TCT 
lys leu gin asn ile asn leu ala net ser 
OCA GAG TCA CIG GAA CTT TCT TIT ADC ATC 
pro glu ser leu glu leu ser phe thr met 
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1502 OCC CAG ATT CAG GAT CAG ACA OCT ACT OCT 

492 pro gin ile gin asp gin thr pro ser pro 

1622 AAG TIG GAA TIG GEA GAA AAA. CIT TIT OCT 

532 lys lea glu leu val glu lys leu phe ala 

1742 TIC CAG TEA QGT IOC TIC GAT CAG TIG 1CA 

572 phe gin leu arg ser phe asp gin leu ser 

1862 OCT AAT GCC AOC ACT AOC ACT QOC AOC ACT 

612 ala asn ala thr thr thr thr ala thr thr 

1982 ACT ACT QOC ACA TCA 1CA OCA TAT AGA GAT 

652 thr ser ala thr ser ser pro tyr arg asp 

2102 TCT GIC GOT TIG ACT CAA AGA ACT ACA GIT 

692 ser val ala leu ser gin arg thr thr val 

2222 GEA QGA. ATT QGA ACA TTA TEA CAG CAG OCA 

732 val gly ile gly thr leu leu gin gin pro 

2342 ATT TEA ATA O0C TCT GAT TTA OCA TGT AGA 

772 ile leu ile pro ser asp leu q lf^ rv^ aTT T 

2462 Cm CIG CAG OCT GAA GAA TEA CIC AGA OCT 

812 leu leu gin gly glu glu leu leu arg ala 

2605 CIACAATACIQCACAAACTra 

2923 TITEACAIAAATAATAATGC1T1GQ 

3082 CTGGAACATGACATIGITAATCATATAA^ 

3241 TCT3ATCTTICIA3?^CTCAC 

3400 AAAAICATQCATICITAGCAAAATIGQCTAf 

3559 CAGEAAATAlICTIGITITriCK 

FIG. 10-5 
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ICC GAT GGA AGC ACT CAA ACT TCA OCT 
ser asp gly ser thr arg gin ser ser pro 
GAA GAC ACA GAA GCA AAG AAC OCA TTT 1CT 
glu asp thr glu ala lys asn pro phe ser 
CCA TTA GAA AGC ACT TOC GCA AGC CCT GAA 
pro leu glu ser ser ser ala ser pro glu 
GAT GAA TEA AAA ACA GIG ACA AAA GAC CGT 
asp glu leu lys thr val thr lys asp arg 
ACT CAA ACT COG ACA QOC TCA OCA AAC AGA 
thr gin ser arg thr ala ser pro asn arg 
OCT GAG GAA GAA CIA AAT CCA AAG ATA CIA 
pro glu glu glu leu asn pro lys ile leu 
GAC GAT CAT OCA GCT ACT ACA TCA CTT TCT 
asp asp his ala ala thr thr ser leu ser 
CIG CIG QGG CAA TCA AUG GAT GAA ACT GGA 
leu leu gly gin ser net asp glu ser gly 
TIG GAT CAA GIT AAC TGA GCITITICTIAATTT 
leu asp gin val asn OPA 

QGACTTIATCCCITITiaSAA l 

^GCX^O^ATIQCACAAIATA ^ 

AAATGCTGIAT3GrriATIA^ 




TATCCIAATATTG 
AT3CACITIGICG 



FIG. 10-6 
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GAG OCT AAT ACT CDC ACT GAA TAT TCT TTT 
glu pro asn ser pro ser glu tyr cys phe 
ACT CAG GAC ACA GAT TTA GAC TTG GAG AUG 
t±ir gin asp thr asp leu asp leu glu met 
AGC GCA ACT OCT CAA AGC ACA GIT ACA GEA 
ser ala ser pro gin ser thr val thr val 
AUG GAA GAC ATT AAA ATA TTG ATT GCA TCT 
met glu asp ile lys ile leu ile ala ser 
OCA QGA AAA QGA GIC ATA GAA CAG ACA GAA 
ala gly lys gly val ile glu gin thr glu 
GOT TTG CAG AAT OCT CAG AGA AAG OGA AAA 
ala leu gin asn ala gin arg lys arg lys 
TOG AAA OCT GTA AAA QGA TQC AAA TCT ACT 
trp lys arg val lys gly cys lys ser ser 
TEA OCA CAG CIG AOC ACT TAT GAT TCT GAA 
leu pro gin leu thr ser tyr asp cys glu 
CATICCTITITIT3GAC 

TGITCITTAATQCT3GATCAC^ 

AATAII^ATITITCTAAGAA 

AGTTACTCAIQGAAIATATICTO 

TACATAAIIAIWjAAAGATATGCAIZ^ 

TGIAACTGATATTAAACXITAAA11CT 

CIATTAACATCCTITTTT^ 

ACTCTATIGITITCTI1ACATC 

FIG. 10-7 
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TAT GIG GAT ACT GAT AUG GIC AAT GAA TIC 
tyr val asp ser asp met val asn glu phe 
Tm OCT CDC TAT ATC CCA AUG GAT GAT GAC 
leu ala pro tyr ile pro met asp asp asp 
TTC CAG CAG ACT CAA ATA CAA GAA OCT ACT 
phe gin gin thr gin ile gin glu pro thr 
CCA TCT OCT ACC CAC ATA CAT AAA GAA ACT 
pro ser pro thr his ile his lys glu thr 
AAA TCT CAT CCA AGA AGC OCT AAC GIG TEA 
lys ser his pro 'arg ser pro asn val leu 
A3G GAA CAT GAT OCT TCA CIT TTT CAA GCA 
met glu his asp gly ser leu phe gin ^Ig 
GAA CAG AAT GGA AIG GAG CAA AAG ACA ATT 
glu gin asn gly met glu gin lys thr ilg 
GIT AAT OCT CCT ATA CAA QQC AGC AGA AAC 
val asn ala pro ile gin gly ser arg asn 
AGICTATTIATATTITCT 

TITITCGTATITAAACOV^^ 

CATAQGCAGTIGAAAAATIT^^ 

TIAAGAAGAAATITriTriQGCCE^^ 

GGCATTTATTIGGATAAAATICICAA!^^ 

GIATAAAGATATTITGAGCM 

TAAITITAGAAGCATTAIITr^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

FIG. 10-8 
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E 3-UNTRANSLATED SEQUENCES 

2568 CU AUUUA UA 
2656 UA AUUUA CA 
2731 GU AUUUA AA 
2781 UU AUUUA UU 
2785 UU AUUUA UU 
3138 UU AUUUA AA 
3156 CC AUUUA CA 
3203 GC AUUUA UU 
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FIG. UD 
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